This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 

BEST AVAILABLE IMAGES 



Defective images within this document are accurate representations of 
the original documents submitted by the applicant. 

Defects in the images may include (but are not limited to): 



BLACK BORDERS 

TEXT CUT OFF AT TOP, BOTTOM OR SIDES 
FADED TEXT 
ILLEGIBLE TEXT 
SKEWED/SLANTED IMAGES 
COLORED PHOTOS 

BLACK OR VERY BLACK AND WHITE DARK PHOTOS 
GRAY SCALE DOCUMENTS 



IMAGES ARE BEST AVAILABLE COPY. 



As rescanning documents will not correct images, 
please do not report the images to the 
Image Problem Mailbox. 



PCT 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 

C12N 9/02, 1/20, 15/09, 15/63, C12P 
13/00 



Al 



(11) International Publication Number: 
(43) International Publication Date: 



WO 98/18910 

7 May 1998 (07.05.98) 



(21) International Application Number: PCT/US97/ 17819 

(22) International Filing Date: 3 October 1997 (03.10.97) 



(30) Priority Data: 

08/742,605 



28 October 1996 (28.10.96) 



US 



(71) Applicant (for all designated Stales except US): YISSUM 
RESEARCH AND DEVELOPMENT COMPANY OF 
THE HEBREW UNIVERSITY OF JERUSALEM [IUIL]; 
Jabotinsky 46, 91042 Jerusalem (IL). 

(71) Applicant (for TJ only): FRIEDMAN, Mark [US/1L]; Alharizi 

1,43406 Raanana (IL). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): MANN, Varda [IL/IL]; 
Nof Harim 23, 96190 Mevasseret Zion (IL). HIRSCHBERG, 
Joseph [IUIL]; Borla 26, 93714 Jerusalem (IL). LOTAN, 
Tamar [IUIL1; 15105 Kineret (IL). HARKER, Mark [IL/IL]; 
Narkis 9, 92461 Jerusalem (IL). 

(74) Agent: CASTORINA, Anthony; Mark Friedman Ltd., Suite 
207, 2001 Jefferson Davis Highway, Arlington, VA 22202 
(US). 



(81) Designated States: AL, AM, AT, AU, AZ, BA, BB, BG, BR, 
BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES, FI, GB, GE, 
GH, HU, IL, IS, JP, KE, KG, KP, KR, KZ, LC, LK, LR, 
LS, LT, LU, LV, MD, MG, MK, MN, MW, MX, NO, NZ, 
PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, TJ, TM, TR, 
TT, UA, UG, US, UZ, VN, YU, ZW, ARIPO patent (GH, 
KE, LS, MW, SD. SZ, UG, ZW), Eurasian patent (AM, AZ, 
BY, KG, KZ, MD, RU, TJ, TM), European patent (AT, BE, 
CH, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, MC, NL, 
PT, SE), OAPI patent (BF, BJ, CF, CG, CI, CM, GA, GN, 
ML, MR, NE, SN, TO, TG). 



Published 

With international search report. 



(54) Title: NUCLEIC ACID SEQUENCE ENCODING BETA-C-4-OXYGENASE FROM HAEMATOCOCCUS PLUVI ALIS FOR THE 
BIOSYNTHESIS OF ASTAXANTHTN 



(57) Abstract 



The present invention relates, in general, to a biotechnological method for production of (35,3*5) astaxanthin. In particular, the 
present invention relates to a peptide having a /?-C-4-oxygenase activity; a DNA segment coding for this peptide; an RNA segment coding 
for this peptide; a recombinant DNA molecule comprising a vector and the DNA segment; a host cell or organism containing the above 
described recombinant DNA molecule or DNA segment; and to a method of biotechnological ly producing (35,3*5) astaxanthin or a food 
additive containing (35,3*5) astaxanthin, using the host. 



FOR THE PURPOSES OF INFORMATION ONLY 
Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KB 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


zw 


Zimbabwe 


CI 


C6tc d' I voire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






CZ 


Czech Republic 


LC 


Samt Lucia 


RU 


Russian Federation 






DE 


Germany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







WO 98/18910 



PCT/US97/17819 



NUCLEIC ACID SEQUENCE ENCODING BETA-C-4-OXYGENASE FROM HAEMATOCOCCUS PLUVIALIS FOR THE 
BIOSYNTHESIS OF ASTAXANTHIN 

5 

FIELD AND BACKGROUND OF TH E I TS tyENIIQN 

The present invention relates, in general, to a biotechnological method for 
production of (3S,yS) astaxanthin. In particular, the present invention relates to a 

10 peptide having a P-C-4-oxygenase activity; a DNA segment coding for this 
peptide; an RNA segments coding for this peptide; a recombinant DNA molecule 
comprising a vector and the DNA segment; a host cell or organism containing the 
above described recombinant DNA molecule or DNA segment; and to a method of 
biotechnologically producing (3S,3'S) astaxanthin or a food additive containing 

15 (35,3'S) astaxanthin, using the host. 

Carotenoids, such as astaxanthin, are natural pigments that are responsible 
for many of the yellow, orange and red colors seen in living organisms. 
Carotenoids are widely distributed in nature and have, in various living systems, 
two main biological functions: they serve as light-harvesting pigments in 

20 photosynthesis, and they protect against photooxidative damage. These and 
additional biological functions of carotenoids, their important industrial role, and 
their biosynthesis are discussed hereinbelow. 

As part of the light-harvesting antenna, carotenoids can absorb photons and 
transfer the energy to chlorophyll, thus assisting in the harvesting of light in the 

25 range of 450 - 570 nm [see, Cogdell RJ and Frank HA (1987) How carotenoids 
function in photosynthestic bacteria. Biochim Biophys Acta 895: 63-79; Cogdell R 
(1988) The function of pigments in chloroplasts. In: Goodwin TW (ed) Plant 
Pigments, pp 183-255. Academic Press, London; Frank HA, Violette CA, 
Trautman JK, Shreve AP, Owens TG and Albrecht AC (1991) Carotenoids in 

30 photosynthesis: structure and photochemistry. Pure Appl Chem 63: 109-1 14; Frank 
HA, Farhoosh R, Decoster B and Christensen RL (1992) Molecular features that 
control the efficiency of carotenoid-to-chlorophyll energy transfer in 
photosynthesis. In: Murata N (ed) Research in Photosynthesis, Vol I, pp 125-128. 
Kluwer, Dordrecht; and, Cogdell RJ and Gardiner AT (1993) Functions of 

35 carotenoids in photosynthesis. Meth Enzymol 214: 185-193]. Although 
carotenoids are integral constituents of the protein-pigment complexes of the light- 
harvesting antennae in photosynthetic organisms, they are also important 
components of the photosynthetic reaction centers. 
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Most of the total carotenoids is located in the light harvesting complex II 
[Bassi R, Pineaw B, Dainese P and Marquartt J (1993) Carotenoid binding proteins 
of photosystem IL Eur J Biochem 212: 297-302]. The identities of the 
photosynthetically active carotenoproteins and their precise location in light- 
harvesting systems are not known. Carotenoids in photochemically active 
chlorophyll-protein complexes of the thermophilic cyanobacterium Synechococcus 
sp. were investigated by linear dichroism spectroscopy of oriented samples [see, 
Breton J and Kato S (1987) Orientation of the pigments in photosystem II: low- 
temperature linear-dichroism study of a core particle and of its chlorophyll-protein 
subunits isolated from Synechococcus sp. Biochim Biophys Acta 892: 99-107]. 
These complexes contained mainly a P-carotene pool absorbing around 505 and 
470 nm, which is oriented close to the membrane plane. In photochemically 
inactive chlorophyll-protein complexes, the p-carotene absorbs around 495 and 
465 nm, and the molecules are oriented perpendicular to the membrane plane. 

Evidence that carotenoids are associated with cyanobacterial photosystem 
(PS) II has been described [see, Suzuki R and Fujita Y (1977) Carotenoid 
photobleaching induced by the action of photosynthetic reaction center II: DCMU 
sensitivity. Plant Cell Physiol 18: 625-631; and, Newman PJ and Sherman LA 
(1978) Isolation and characterization of photosystem I and II membrane particles 
from the blue-green alga Synechococcus cedrorum. Biochim Biophys Acta 503: 
343-361]. There are two p-carotene molecules in the reaction center core of PS II 
[see, Ohno T, Satoh K and Katoh S (1986) Chemical composition of purified 
oxygen-evolving complexes from the thermophilic cyanobacterium Synechococcus 
sp. Biochim Biophys Acta 852: 1-8; Gounaris K, Chapman DJ and Barber J (1989) 
Isolation and characterization of a Dl/D2/cytochrome £-559 complex from 
Synechocystis PCC6803. Biochim Biophys Acta 973: 296-301; and, Newell RW, 
van Amerongen H, Barber J and van Grondelle R (1993) Spectroscopic 
characterization of the reaction center of photosystem II using polarized light: 
Evidence for p-carotene excitors in PS II reaction centers. Biochim Biophys Acta 
1057: 232-238] whose exact fiinction(s) is still obscure [reviewed by Satoh K 
(1992) Structure and function of PS II reaction center. In: Murata N (ed) Research 
in Photosynthesis, Vol. II, pp. 3-12. Kluwer, Dordrecht]. It was demonstrated that 
these two coupled p-carotene molecules protect chlorophyll P680 from 
photodamage in isolated PS II reaction centers [see, De Las Rivas J, Telfer A and 
Barber J (1993) 2-coupled p-carotene molecules protect P680 from photodamage 
in isolated PS II reaction centers. Biochim. Biophys. Acta 1 142: 155-164], and this 
may be related to the protection against degradation of the Dl subunit of PS II 
[see, Sandmann G (1993) Genes and enzymes involved in the desaturation 
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reactions from phytoene to lycopene. (abstract), 10th International Symposium on 
Carotenoids, Trondheim CL1-2]. The light-harvesting pigments of a highly 
purified, oxygen-evolving PS II complex of the thermophilic cyanobacterium 
Synechococcus sp. consists of 50 chlorophyll a and 7 P-carotene, but no 
5 xanthophyll, molecules [see, Ohno T, Satoh K and Katoh S (1986) Chemical 
composition of purified oxygen-evolving complexes from the thermophilic 
cyanobacterium Synechococcus sp. Biochim Biophys Acta 852: 1-8]. p-carotene 
was shown to play a role in the assembly of an active PS II in green algae [see, 
Humbeck K, Romer S and Senger H (1989) Evidence for the essential role of 

10 carotenoids in the assembly of an active PS II. Planta 179: 242-250]. 

Isolated complexes of PS I from Phormidium luridum, which contained 40 
chlorophylls per P700, contained an average of 1 .3 molecules of P-carotene [see, 
Thornber JP, Alberte RS, Hunter FA, Shiozawa J A and Kan KS (1976) The 
organization of chlorophyll in the plant photosynthetic unit. Brookhaven Symp 

15 Biology 28: 132-148]. In a preparation of PS I particles from Synechococcus sp. 
strain PCC 6301, which contained 130 ± 5 molecules of antenna chlorophylls per 
P700, 16 molecules of carotenoids were detected [see, Lundell DJ, Glazer AN, 
Melis A and Malkin R (1985) Characterization of a cyanobacterial photosystem I 
complex. J Biol Chem 260: 646-654]. A substantial content of p-carotene and the 

20 xanthophylls cryptoxanthin and isocryptoxanthin were detected in PS I pigment- 
protein complexes of the thermophilic cyanobacterium Synechococcus elongatus 
[see, Coufal J, Hladik J and Sofrova D (1989) The carotenoid content of 
photosystem 1 pigment-protein complexes of the cyanobacterium Synechococcus 
elongatus, Photosynthetica 23: 603-616]. A subunit protein-complex structure of 

25 PS I from the thermophilic cyanobacterium Synechococcus sp., which consisted of 
four polypeptides (of 62, 60, 14 and 10 kDa), contained approximately 10 p- 
carotene molecules per P700 [see, Takahashi Y, Hirota K and Katoh S (1985) 
Multiple forms of P700-chlorophyll ^-protein complexes from Synechococcus sp.: 
the iron, quinone and carotenoid contents. Photosynth Res 6: 183-192]. This 

30 carotenoid is exclusively bound to the large polypeptides which carry the 
functional and antenna chlorophyll a. The fluorescence excitation spectrum of 
these complexes suggested that p-carotene serves as an efficient antenna for PS I. 

As mentioned, an additional essential function of carotenoids is to protect 
against photooxidation processes in the photosynthetic apparatus that are caused 

35 by the excited triplet state of chlorophyll. Carotenoid molecules with 7t-electron 
conjugation of nine or more carbon-carbon double bonds can absorb triplet-state 
energy from chlorophyll and thus prevent the formation of harmful singlet-state 
oxygen radicals. In Synechococcus sp. the triplet state of carotenoids was 
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monitored in closed PS II centers and its rise kinetics of approximately 25 
nanoseconds is attributed to energy transfer from chlorophyll triplets in the antenna 
[see, Schlodder E and Brettel K (1988) Primary charge separation in closed 
photosystem II with a lifetime of 1 1 nanoseconds. Flash-absorption spectroscopy 
with oxygen-evolving photosystem II complexes from Synechococcus. Biochim 
Biophys Acta 933: 22-34]. It is conceivable that this process, that has a lower 
yield compared to the yield of radical-pair formation, plays a role in protecting 
chlorophyll from damage due to over-excitation. 

The protective role of carotenoids in vivo has been elucidated through the 
use of bleaching herbicides such as norflurazon that inhibit carotenoid biosynthesis 
in all organisms performing oxygenic photosynthesis [reviewed by Sandmann G 
and Boger P (1989) Inhibition of carotenoid biosynthesis by herbicides. In: Boger 
P and Sandmann G (Eds.) Target Sites of Herbicide Action, pp 25-44. CRC Press, 
Boca Raton, Florida]. Treatment with norflurazon in the light results in a decrease 
of both carotenoid and chlorophyll levels, while in the dark, chlorophyll levels are 
unaffected. Inhibition of photosynthetic efficiency in cells of Oscillatoria agardhii 
that were treated with the pyridinone herbicide, fluridone, was attributed to a 
decrease in the relative abundance of myxoxanthophyll, zeaxanthin and p- 
carotene, which in turn caused photooxidation of chlorophyll molecules [see, 
Canto de Loura I, Dubacq JP and Thomas JC (1987) The effects of nitrogen 
deficiency on pigments and lipids of cianobacteria. Plant Physiol 83: 838-843]. 

It has been demonstrated in plants that zeaxanthin is required to dissipate, in 
a nonradiative manner, the excess excitation energy of the antenna chlorophyll 
[see, Demmig- Adams B (1990) Carotenoids and photoprotection in plants: a role 
for the xanthophyll zeaxanthin. Biochim Biophys Acta 1020: 1-24; and, Demmig- 
Adams B and Adams WW III (1990) The carotenoid zeaxanthin and high-energy- 
state quenching of chlorophyll fluorescence. Photosynth Res 25: 187-197]. In 
algae and plants a light-induced deepoxidation of violaxanthin to yield zeaxanthin, 
is related to photoprotection processes [reviewed by Demmig-Adams B and 
Adams WW III (1992) Photoprotection and other responses of plants to high light 
stress. Ann Rev Plant Physiol Plant Mol Biol 43: 599-626]. The light-induced 
deepoxidation of violaxanthin and the reverse reaction that takes place in the dark, 
are known as the "xanthophyll cycle" [see, Demmig-Adams B and Adams WW III 
(1992) Photoprotection and other responses of plants to high light stress. Ann Rev 
Plant Physiol Plant Mol Biol 43: 599-626]. Cyanobacterial lichens, that do not 
contain any zeaxanthin and that probably are incapable of radiationless energy 
dissipation, are sensitive to high light intensity; algal lichens that contain 
zeaxanthin are more resistant to high-light stress [see, Demmig-Adams B, Adams 
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WW III, Green . TGA, Czygan FC and Lange OL (1990) Differences in the 
susceptibility to light stress in two lichens forming a phycosymbiodeme, one 
partner possessing and one lacking the xanthophyll cycle. Oecologia 84: 451-456; 
Demmig-Adams B and Adams WW III (1993) The xanthophyll cycle, protein 
turnover, and the high light tolerance of sun-acclimated leaves. Plant Physiol 103: 
1413-1420; and, Demmig-Adams B (1990) Carotenoids and photoprotection jn 
plants: a role for the xanthophyll zeaxanthin. Biochim Biophys Acta 1020: 1-24]. 
In contrast to algae and plants, cyanobacteria do not have a xanthophyll cycle. 
However, they do contain ample quantities of zeaxanthin and other xanthophylls 
that can support photoprotection of chlorophyll. 

Several other functions have been ascribed to carotenoids. The possibility 
that carotenoids protect against damaging species generated by near ultra-violet 
(UV) irradiation is suggested by results describing the accumulation of P-carotene 
in a UV-resistant mutant of the cyanobacterium Gloeocapsa alpicola [see, Buckley 
CE and Houghton JA (1976) A study of the effects of near UV radiation on the 
pigmentation of the blue-green alga Gloeocapsa alpicola. Arch Microbiol 107: 93- 
97]. This has been demonstrated more elegantly in Escherichia coli cells that 
produce carotenoids [see, Tuveson RW and Sandmann G (1993) Protection by 
cloned carotenoid genes expressed in Escherichia coli against phototoxic 
molecules activated by near-ultraviolet light. Meth Enzymol 214: 323-330]. Due 
to their ability to quench oxygen radical species, carotenoids are efficient anti- 
oxidants and thereby protect cells from oxidative damage. This function of 
carotenoids is important in virtually all organisms [see, Krinsky NI (1989) 
Antioxidant functions of carotenoids. Free Radical Biol Med 7: 617-635; and, 
Palozza P and Krinsky NI (1992) Antioxidant effects of carotenoids in vivo and in 
vitro - an overview. Meth Enzymol 213: 403-420]. Other cellular functions could 
be affected by carotenoids, even if indirectly. Although carotenoids in 
cyanobacteria are not the major photoreceptors for phototaxis, an influence of 
carotenoids on phototactic reactions, that have been observed in Anabaena 
variabilis, was attributed to the removal of singlet oxygen radicals that may act as 
signal intermediates in this system [see, Nultsch W and Schuchart H (1985) A 
model of the phototactic reaction chain of cyanobacterium Anabaena variabilis. 
Arch Microbiol 142: 180-184]. 

In flowers and fruits carotenoids facilitate the attraction of pollinators and 
dispersal of seeds. This latter aspect is strongly associated with agriculture. The 
type and degree of pigmentation in fruits and flowers are among the most 
important traits of many crops. This is mainly since the colors of these products 
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often determine their appeal to the consumers and thus can increase their market 
worth. 

Carotenoids have important commercial uses as coloring agents in the food 
industry since they are non-toxic [see, Bauernfeind JC (1981) Carotenoids as 
5 colorants and vitamin A precursors. Academic Press, London]. The red color of 
the tomato fruit is provided by lycopene which accumulates during fruit ripening 
in chromoplasts. Tomato extracts, which contain high content (over 80% dry 
weight) of lycopene, are commercially produced worldwide for industrial use as 
food colorant. Furthermore, the flesh, feathers or eggs of fish and birds assume the 

10 color of the dietary carotenoid provided, and thus carotenoids are frequently used 
in dietary additives for poultry and in aquaculture. Certain cyanobacterial species, 
for example Spirulina sp. [see, Sommer TR, Potts WT and Morrissy NM (1990) 
Recent progress in processed microalgae in aquaculture. Hydrobiologia 204/205: 
435-443], are cultivated in aquaculture for the production of animal and human 

15 food supplements. Consequently, the content of carotenoids, primarily of p- 
carotene, in these cyanobacteria has a major commercial implication in 
biotechnology. 

Most carotenoids are composed of a C40 hydrocarbon backbone, 
constructed from eight C5 isoprenoid units and contain a series of conjugated 

20 double bonds. Carotenes do not contain oxygen atoms and are either linear or 
cyclized molecules containing one or two end rings. Xanthophylls are oxygenated 
derivatives of carotenes. Various glycosilated carotenoids and carotenoid esters 
- have been identified. The C40 backbone can be further extended to give C45 or 
C50 carotenoids, or shortened yielding apocarotenoids. Some nonphotosynthetic 

25 bacteria also synthesize C30 carotenoids. General background on carotenoids can 
be found in Goodwin TW (1980) The Biochemistry of the Carotenoids, Vol. 1, 2nd 
Ed. Chapman and Hall, New York; and in Goodwin TW and Britton G (1988) 
Distribution and analysis of carotenoids. In: Goodwin TW (ed) Plant Pigments, pp 
62-132. Academic Press, New York. 

30 More than 640 different naturally-occurring carotenoids have been so far 

characterized, hence, carotenoids are responsible for most of the various shades of 
yellow, orange and red found in microorganisms, fungi, algae, plants and animals. 
Carotenoids are synthesized by all photosynthetic organisms as well as several 
nonphotosynthetic bacteria and fungi, however they are also widely distributed 

35 through feeding throughout the animal kingdom. 

Carotenoids are synthesized de novo from isoprenoid precursors only in 
photosynthetic organisms and some microorganisms, they typically accumulate in 
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protein complexes in the photosynthetic membrane, in the cell membrane and in 
the cell wall. 

As detailed in Figure 1, in the biosynthesis pathway of P-carotene, four 
enzymes convert geranylgeranyl pyrophosphate of the central isoprenoid pathway 
5 to p-carotene. Carotenoids are produced from the general isoprenoid biosynthetic 
pathway. While this pathway has been known for several decades, only recently, 
and mainly through the use of genetics and molecular biology, have some of the 
molecular mechanisms involved in carotenoids biogenesis, been elucidated. This 
is due to the fact that most of the enzymes which take part in the conversion of 

m phytoene to carotenes and xanthophylls are labile, membrane-associated proteins 
that lose activity upon solubilization [see, Beyer P, Weiss G and Kleinig H (1985) 
Solubilization and reconstitution of the membrane-bound carotenogenic enzymes 
from daffodile chromoplasts. Eur J Biochem 153: 341-346; and, Bramley PM 
(1985) The in vitro biosynthesis of carotenoids. Adv Lipid Res 21: 243-279]. 

15 However, solubilization of carotenogenic enzymes from Synechocystis sp. strain 
PCC 6714 that retain partial activity has been reported [see, Bramley PM and 
Sandmann G (1987) Solubilization of carotenogenic enzyme of Aphanocapsa. 
Phytochem 26: 1935-1939]. There is no genuine in vitro system for carotenoid 
biosynthesis which enables a direct essay of enzymatic activities. A cell-free 

20 carotenogenic system has been developed [see, Clarke IE, Sandmann G, Bramley 
PM and Boger P (1982) Carotene biosynthesis with isolated photosynthetic 
membranes. FEBS Lett 140: 203-206] and adapted for cyanobacteria [see, 
Sandmann G and Bramley PM (1985) Carotenoid biosynthesis by Aphanocapsa 
homogenates coupled to a phytoene-generating system from Phycomyces 

25 blakesleeanus. Planta 164: 259-263; and, Bramley PM and Sandmann G (1985) In 
vitro and in vivo biosynthesis of xanthophylls by the cyanobacterium 
Aphanocapsa, Phytochem 24: 2919-2922]. Reconstitution of phytoene desaturase 
from Synechococcus sp. strain PCC 7942 in liposomes was achieved following 
purification of the polypeptide, that had been expressed in Escherichia coli [see, 

30 Fraser PD, Linden H and Sandmann G (1993) Purification and reactivation of 
recombinant Synechococcus phytoene desaturase from an overexpressing strain of 
Escherichia coli. Biochem J 291: 687-692], 

Referring now to Figure 1, carotenoids are synthesized from isoprenoid 
precursors. The central pathway of isoprenoid biosynthesis may be viewed as 

35 beginning with the conversion of acetyl-CoA to mevalonic acid. D^-isopentenyl 
pyrophosphate (IPP), a C5 molecule, is formed from mevalonate and is the 
building block for all long-chain isoprenoids. Following isomerization of IPP to 
dimethylallyl pyrophosphate (DMAPP), three additional molecules of IPP are 
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combined to yield the C20 molecule, geranylgeranyl pyrophosphate (GGPP). 
These 1-4 condensation reactions are catalyzed by prenyl transferases [see, 
Kleinig H (1989) The role of plastids in isoprenoid biosynthesis. Ann Rev Plant 
Physiol Plant Mol Biol 40: 39-59]. There is evidence in plants that the same 
5 enzyme, GGPP synthase, carries out all the reactions from DMAPP to GGPP [see, 
Dogbo O and Camara B (1987) Purification of isopentenyl pyrophosphate 
isomerase and geranylgeranyl pyrophosphate synthase from Capsicum 
chromoplasts by affinity chromatography. Biochim Biophys Acta 920: 140-148; 
and, Laferriere A and Beyer P (1991) Purification of geranylgeranyl diphosphate 

10 synthase from Sinapis alba etioplasts. Biochim Biophys Acta 216: 1 56-163]. 

The first step that is specific for carotenoid biosynthesis is the head-to-head 
condensation of two molecules of GGPP to produce prephytoene pyrophosphate 
(PPPP). Following removal of the pyrophosphate, GGPP is converted to 15-c/s- 
phytoene, a colorless C40 hydrocarbon molecule. This two-step reaction is 

15 catalyzed by the soluble enzyme, phytoene synthase, an enzyme encoded by a 
single gene (crtB), in both cyanobacteria and plants [see, Chamovitz D, Misawa N, 
Sandmann G and Hirschberg J (1992) Molecular cloning and expression in 
Escherichia coli of a cyanobacterial gene coding for phytoene synthase, a 
carotenoid biosynthesis enzyme. FEBS Lett 296: 305-310; Ray J A, Bird CR, 

20 Maunders M, Grierson D and Schuch W (1987) Sequence of pTOM5, a ripening 
related cDNA from tomato. Nucl Acids Res 15: 10587-10588; Camara B (1993) 
Plant phytoene synthase complex - component 3 enzymes, immunology, and 
biogenesis. Meth Enzymol 214: 352-365]. All the subsequent steps in the pathway 
occur in membranes. Four desaturation (dehydrogenation) reactions convert 

25 phytoene to lycopene via phytofluene, ^-carotene, and neurosporene. Each 
desaturation increases the number of conjugated double bonds by two such that 
the number of conjugated double bonds increases from three in phytoene to eleven 
in lycopene. 

Relatively little is known about the molecular mechanism of the enzymatic 
30 dehydrogenation of phytoene [see, Jones BL and Porter JW (1986) Biosynthesis of 
carotenes in higher plants. CRC Crit Rev Plant Sci 3: 295-324; and, Beyer P, 
Mayer M and Kleinig H (1989) Molecular oxygen and the state of geometric 
iosomerism of intermediates are essential in the carotene desaturation and 
cyclization reactions in daffodil chromoplasts. Eur J Biochem 184: 141-150]. It 
35 has been established that in cyanobacteria, algae and plants the first two 
desaturations, from 15-c/s-phytoene to ^-carotene, are catalyzed by a single 
membrane-bound enzyme, phytoene desaturase [see, Jones BL and Porter JW 
(1986) Biosynthesis of carotenes in higher plants. CRC Crit Rev Plant Sci 3: 295- 
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324; and, Beyer P, Mayer M and Kleinig H (1989) Molecular oxygen and the state 
of geometric iosomerism of intermediates are essential in the carotene desaturation 
and cyclization reactions in daffodil chromoplasts. Eur J Biochem 184: 141-150]. 
Since the {^-carotene product is mostly in the a\\-trans configuration, a cis-trans 
5 isomerization is presumed at this desaturation step. The primary structure of the 
phytoene desaturase polypeptide in cyanobacteria is conserved (over 65% identical 
residues) with that of algae and plants [see, Pecker I, Chamovitz D, Linden H, 
Sandmann G and Hirschberg J (1992) A single polypeptide catalyzing the 
conversion of phytoene to ^-carotene is transcriptionally regulated during tomato 

io fruit ripening. Proc Natl Acad Sci USA 89: 4962-4966; Pecker I, Chamovitz D, 
Mann V, Sandmann G, Boger P and Hirschberg J (1993) Molecular 
characterization of carotenoid biosynthesis in plants: the phytoene desaturase gene 
in tomato. In: Murata N (ed) Research in Photosynthesis, Vol III, pp 11-18. 
Kluwer. Dordrectht]. Moreover, the same inhibitors block phytoene desaturase in 

15 the two systems [see, Sandmann G and Boger P (1989) Inhibition of carotenoid 
biosynthesis by herbicides. In: Boger P and Sandmann G (eds) Target Sites of 
Herbicide Action, pp 25-44. CRC Press, Boca Raton, Florida]. Consequently, it is 
very likely that the enzymes catalyzing the desaturation of phytoene and 
phytofluene in cyanobacteria and plants have similar biochemical and molecular 

20 properties, that are distinct from those of phytoene desaturases in other 
microorganisms. One such a difference is that phytoene desaturases from 
Rhodobacter capsulatus, Erwinia sp. or fungi convert phytoene to neurosporene, 
lycopene, or 3,4-dehydrolycopene, respectively. 

Desaturation of phytoene in daffodil chromoplasts [see, Beyer P, Mayer M 

25 and Kleinig H (1989) Molecular oxygen and the state of geometric iosomerism of 
intermediates are essential in the carotene desaturation and cyclization reactions in 
daffodil chromoplasts. Eur J Biochem 184: 141-150], as well as in a cell free 
system of Synechococcus sp. strain PCC 7942 [see, Sandmann G and Kowalczyk S 
(1989) In vitro carotenogenesis and characterization of the phytoene desaturase 

30 reaction mAnacystis. Biochem Biophys Res Com 163: 916-921], is dependent on 
molecular oxygen as a possible final electron acceptor, although oxygen is not 
directly involved in this reaction. A mechanism of dehydrogenase-electron 
transferase was supported in cyanobacteria over dehydrogenation mechanism of 
dehydrogenase-monooxygenase [see, Sandmann G and Kowalczyk S (1989) In 

35 vitro carotenogenesis and characterization of the phytoene desaturase reaction in 
Anacystis. Biochem Biophys Res Com 163: 916-921]. A conserved FAD-binding 
motif exists in all phytoene desaturases whose primary structures have been 
analyzed [see, Pecker I, Chamovitz D, Linden H, Sandmann G and Hirschberg J 
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(1992) A single polypeptide catalyzing the conversion of phytoene to ^-carotene is 
transcriptionally regulated during tomato fruit ripening. Proc Natl Acad Sci USA 
89: 4962-4966; Pecker I, Chamovitz D, Mann V, Sandmann G, Boger P and 
Hirschberg J (1993) Molecular characterization of carotenoid biosynthesis in 
5 plants: the phytoene desaturase gene in tomato. In: Murata N (ed) Research in 
Photosynthesis, Vol III, pp 11-18. Kluwer, Dordrectht]. The phytoene desaturase 
enzyme in pepper was shown to contain a protein-bound FAD [see, Hugueney P, 
Romer S, Kuntz M and Camara B (1992) Characterization and molecular cloning 
of a flavoprotein catalyzing the synthesis of phytofluene and (^-carotene in 

io Capsicum chromoplasts. Eur J Biochem 209: 399-407]. Since phytoene desaturase 
is located in the membrane, an additional, soluble redox component is predicted. 
This hypothetical component could employ NAD(P) + , as suggested [see, Mayer 
MP, Nievelstein V and Beyer P (1992) Purification and characterization of a 
NADPH dependent oxidoreductase from chromoplasts of Narcissus 

15 pseudonarcissus - a redox-mediator possibly involved in carotene desaturation. 
Plant Physiol Biochem 30: 389-398] or another electron and hydrogen carrier, such 
as a quinone. The cellular location of phytoene desaturase in Synechocystis sp. 
strain PCC 6714 and Anabaena variabilis strain ATCC 29413 was determined 
with specific antibodies to be mainly (85%) in the photosynthetic thylakoid 

20 membranes [see, Serrano A, Gimenez P, Schmidt A and Sandmann G (1990) 
Immunocytochemical localization and functional determination of phytoene 
desaturase in photoautotrophic prokaryotes. J Gen Microbiol 136: 2465-2469]. 

In cyanobacteria algae and plants ^-carotene is converted to lycopene via 
neurosporene. Very little is known about the enzymatic mechanism, which is 

25 predicted to be carried out by a single enzyme [see, Linden H, Vioque A and 
Sandmann G (1993) Isolation of a carotenoid biosynthesis gene coding for 
carotene desaturase from Anabaena PCC 7120 by heterologous complementation. 
FEMS Microbiol Lett 106: 99-104]. The deduced amino acid sequence of 
carotene desaturase in Anabaena sp. strain PCC 7120 contains a dinucleotide- 

30 binding motif that is similar to the one found in phytoene desaturase. 

Two cyclization reactions convert lycopene to p-carotene. Evidence has 
been obtained that in Synechococcus sp. strain PCC 7942 [see, Cunningham FX Jr, 
Chamovitz D, Misawa N, Gantt E and Hirschberg J (1993) Cloning and functional 
expression in Escherichia coli of a cyanobacterial gene for lycopene cyclase, the 

35 enzyme that catalyzes the biosynthesis of P-carotene. FEBS Lett 328: 130-138], as 
well as in plants [see, Camara B and Dogbo O (1986) Demonstration and 
solubilization of lycopene cyclase from Capsicum chromoplast membranes. Plant 
Physiol 80: 172-184], these two cyclizations are catalyzed by a single enzyme. 
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lycopene cyclase. This membrane-bound enzyme is inhibited by the triethylamine 
compounds, CPTA and MPTA [see, Sandmann G and Boger P (1989) Inhibition of 
carotenoid biosynthesis by herbicides. In: Boger P and Sandmann G (eds) Target 
Sites of Herbicide Action, pp 25-44. CRC Press, Boca Raton, Florida]. 
5 Cyanobacteria carry out only the p-cyclization and therefore do not contain s- 
carotene, 8-carotene and a-carotene and their oxygenated derivatives. The P-ring 
is formed through the formation of a "carbonium ion" intermediate when the C-1,2 
double bond at the end of the linear lycopene molecule is folded into the position 
of the C-5,6 double bond, followed by a loss of a proton from C-6. No cyclic 

io carotene has been reported in which the 7,8 bond is not a double bond. Therefore, 
full desaturation as in lycopene, or desaturation of at least half-molecule as in 
neurosporene, is essential for the reaction. Cyclization of lycopene involves a 
dehydrogenation reaction that does not require oxygen. The cofactor for this 
reaction is unknown. A dinucleotide-binding domain was found in the lycopene 

15 cyclase polypeptide of Synechococcus sp. strain PCC 7942, implicating NAD(P) or 
FAD as coenzymes with lycopene cyclase. 

The addition of various oxygen-containing side groups, such as hydroxy-, 
methoxy-, oxo-, epoxy-, aldehyde or carboxylic acid moieties, form the various 
xanthophyll species. Little is known about the formation of xanthophylls. 

20 Hydroxylation of (i-carotene requires molecular oxygen in a mixed-function 
oxidase reaction. 

Clusters of genes encoding the enzymes for the entire pathway have been 
cloned from the purple photosynthetic bacterium Rhodobacter capsulatus [see, 
Armstrong GA, Alberti M, Leach F and Hearst JE (1989) Nucleotide sequence, 

25 organization, and nature of the protein products of the carotenoid biosynthesis 
gene cluster of Rhodobacter capsulatus, Mol Gen Genet 216: 254-268] and from 
the nonphotosynthetic bacteria Erwinia herbicola [see, Sandmann G, Woods WS 
and Tuveson RW (1990) Identification of carotenoids in Erwinia herbicola and in 
transformed Escherichia coli strain. FEMS Microbiol Lett 71: 77-82; Hundle BS, 

30 Beyer P, Kleinig H, Englert H and Hearst JE (1991) Carotenoids of Erwinia 
herbicola and an Escherichia coli HB101 strain carrying the Erwinia herbicola 
carotenoid gene cluster. Photochem Photobiol 54: 89-93; and, Schnurr G, Schmidt 
A and Sandmann G (1991) Mapping of a carotenogenic gene cluster from Erwinia 
herbicola and functional identification of six genes. FEMS Microbiol Lett 78: 157- 

35 162] and Erwinia uredovora [see, Misawa N, Nakagawa M, Kobayashi K, 
Yamano S, Izawa I, Nakamura K and Harashima K (1990) Elucidation of the 
Erwinia uredovora carotenoid biosynthetic pathway by functional analysis of 
gene products in Escherichia coli. J Bacteriol 172: 6704-6712]. Two genes, al-3 
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for GGPP synthase [see, Nelson MA, Morelli G 5 Carattoli A, Romano N and 
Macino G (1989) Molecular cloning of a Neurospora crassa carotenoid 
biosynthetic gene (albino-3) regulated by blue light and the products of the white 
collar genes. Mol Cell Biol 9: 1271-1276; and, Carattoli A, Romano N, Ballario P, 
5 Morelli G and Macino G (1991) The Neurospora crassa carotenoid biosynthetic 
gene (albino 3). J Biol Chem 266: 5854-5859] and al-l for phytoene desaturase 
[see, Schmidhauser TJ, Lauter FR, Russo VEA and Yanofsky C (1990) Cloning 
sequencing and photoregulation of al-L a carotenoid biosynthetic gene of 
Neurospora crassa. Mol Cell Biol 10: 5064-5070] have been cloned from the 
10 fungus Neurospora crassa. However, attempts at using these genes as 
heterologous molecular probes to clone the corresponding genes from 
cyanobacteria or plants were unsuccessful due to lack of sufficient sequence 
similarity. 

The first "plant-type" genes for carotenoid synthesis enzyme were cloned 

15 from cyanobacteria using a molecular-genetics approach. In the first step towards 
cloning the gene for phytoene desaturase, a number of mutants that are resistant to 
the phytoene-desaturase-specific inhibitor, norflurazon, were isolated in 
Synechococcus sp. strain PCC 7942 [see, Linden H, Sandmann G, Chamovitz D, 
Hirschberg J and Boger P (1990) Biochemical characterization of Synechococcus 

20 mutants selected against the bleaching herbicide norflurazon. Pestic Biochem 
Physiol 36: 46-51]. The gene conferring norflurazon-resistance was then cloned 
by transforming the wild-type strain to herbicide resistance [see, Chamovitz D, 
Pecker I and Hirschberg J (1991) The molecular basis of resistance to the herbicide 
norflurazon. Plant Mol Biol 16: 967-974; Chamovitz D, Pecker I, Sandmann G, 

25 Boger P and Hirschberg J (1990) Cloning a gene for norflurazon resistance in 
cyanobacteria. Z Naturforsch 45c: 482-486]. Several lines of evidence indicated 
that the cloned gene, formerly called pds and now named crtP, codes for phytoene 
desaturase. The most definitive one was the functional expression of phytoene 
desaturase activity in transformed Escherichia coli cells [see, Linden H, Misawa 

30 N, Chamovitz D, Pecker I, Hirschberg J and Sandmann G (1991) Functional 
complementation in Escherichia coli of different phytoene desaturase genes and 
analysis of accumulated carotenes. Z Naturforsch 46c: 1045-1051; and. Pecker 1, 
Chamovitz D, Linden H, Sandmann G and Hirschberg J (1992) A single 
polypeptide catalyzing the conversion of phytoene to (^-carotene is 

35 transcriptionally regulated during tomato fruit ripening. Proc Natl Acad Sci USA 
89: 4962-4966]. The crtP gene was also cloned from Synechocystis sp. strain 
PCC 6803 by similar methods [see, Martinez-Ferez IM and Vioque A (1992) 
Nucleotide sequence of the phytoene desaturase gene from Synechocystis sp. PCC 



WO 98/18910 



PCT/US97/17819 



6803 and characterization of a new mutation which confers resistance to the 
herbicide norflurazon. Plant Mol Biol 18: 981-983]. 

The cyanobacterial crtP gene was subsequently used as a molecular probe 
for cloning the homologous gene from an alga [see, Pecker I, Chamovitz D, Mann 

5 V, Sandmann G, Boger P and Hirschberg J (1993) Molecular characterization of 
carotenoid biosynthesis in plants: the phytoene desaturase gene in tomato. In: 
Murata N (ed) Research in Photosynthesis, Vol III, pp 1 1-18. Kluwer, Dordrectht] 
and higher plants [see, Bartley GE, Viitanen PV, Pecker I, Chamovitz D, 
Hirschberg J and Scolnik PA (1991) Molecular cloning and expression in 

io photosynthetic bacteria of a soybean cDNA coding for phytoene desaturase, an 
enzyme of the carotenoid biosynthesis pathway. Proc Natl Acad Sci USA 88: 
6532-6536; and, Pecker I, Chamovitz D, Linden H, Sandmann G and Hirschberg J 
(1992) A single polypeptide catalyzing the conversion of phytoene to {^-carotene is 
transcriptionally regulated during tomato fruit ripening. Proc Natl Acad Sci USA 

15 89: 4962-4966], The phytoene desaturases in Synechococcus sp. strain PCC 7942 
and Synechocystis sp. strain PCC 6803 consist of 474 and 467 amino acid residues, 
respectively, whose sequences are highly conserved (74% identities and 86% 
similarities). The calculated molecular mass is 51 kDa and, although it is slightly 
hydrophobic (hydropathy index -0.2), it does not include a hydrophobic region 

20 which is long enough to span a lipid bilayer membrane. The primary structure of 
the cyanobacterial phytoene desaturase is highly conserved with the enzyme from 
the green alga Dunalliela bardawil (61% identical and 81% similar; [see, Pecker I, 
Chamovitz D, Mann V, Sandmann G, Boger P and Hirschberg J (1993) Molecular 
characterization of carotenoid biosynthesis in plants: the phytoene desaturase gene 

25 in tomato. In: Murata N (ed) Research in Photosynthesis, Vol III, pp 11-18. 
Kluwer, Dordrectht]) and from tomato [see, Pecker I, Chamovitz D, Linden H, 
Sandmann G and Hirschberg J (1992) A single polypeptide catalyzing the 
conversion of phytoene to ^-carotene is transcriptionally regulated during tomato 
fruit ripening. Proc Natl Acad Sci USA 89: 4962-4966], pepper [see, Hugueney P, 

30 Romer S, Kuntz M and Camara B (1992) Characterization and molecular cloning 
of a flavoprotein catalyzing the synthesis of phytofluene and ^-carotene in 
Capsicum chromoplasts. Eur J Biochem 209: 399-407] and soybean [see, Bartley 
GE, Viitanen PV, Pecker I, Chamovitz D, Hirschberg J and Scolnik PA (1991) 
Molecular cloning and expression in photosynthetic bacteria of a soybean cDNA 

35 coding for phytoene desaturase, an enzyme of the carotenoid biosynthesis 
pathway. Proc Natl Acad Sci USA 88: 6532-6536] (62-65% identical and -79% 
similar; [see, Chamovitz D (1993) Molecular analysis of the early steps of 
carotenoid biosynthesis in cyanobacteria: Phytoene synthase and phytoene 
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desaturase. Ph.D. Thesis, The Hebrew University of Jerusalem]). The eukaryotic 
phytoene desaturase polypeptides are larger (64 kDa); however, they are processed 
during import into the plastids to mature forms whose sizes are comparable to 
those of the cyanobacterial enzymes. 
5 There is a high degree of structural similarity in carotenoid enzymes of 

Rhodobacter capsulatus, Erwinia sp. and Neurospora crassa [reviewed in 
Armstrong GA, Hundle BS and Hearst JE (1993) Evolutionary conservation and 
structural similarities of carotenoid biosynthesis gene products from 
photosynthetic and nonphotosynthetic organisms. Meth Enzymol 214: 297-311], 

io including in the crtl gene-product, phytoene desaturase. As indicated above, a 
high degree of conservation of the primary structure of phytoene desaturases also 
exists among oxygenic photosynthetic organisms. However, there is little 
sequence similarity, except for the FAD binding sequences at the amino termini, 
between the "plant-type" crtP gene products and the "bacterial-type" phytoene 

15 desaturases {crtl gene products; 19-23% identities and 42-47% similarities). It has 
been hypothesized that crtP and crtl are not derived from the same ancestral gene 
and that they originated independently through convergent evolution [see, Pecker 
I, Chamovitz D, Linden H, Sandmann G and Hirschberg J (1992) A single 
polypeptide catalyzing the conversion of phytoene to (^-carotene is 

20 transcriptionally regulated during tomato fruit ripening. Proc Natl Acad Sci USA 
89: 4962-4966]. This hypothesis is supported by the different dehydrogenation 
sequences that are catalyzed by the two types of enzymes and by their different 
sensitivities to inhibitors. 

Although not as definite as in the case of phytoene desaturase, a similar 

25 distinction between cyanobacteria and plants on the one hand and other 
microorganisms is also seen in the structure of phytoene synthase. The crtB gene 
(formerly psy) encoding phytoene synthase was identified in the genome of 
Synechococcus sp. strain PCC 7942 adjacent to crtP and within the same operon 
[see, Bartley GE, Viitanen PV, Pecker I, Chamovitz D, Hirschberg J and Scolnik 

30 PA (1991) Molecular cloning and expression in photosynthetic bacteria of a 
soybean cDNA coding for phytoene desaturase, an enzyme of the carotenoid 
biosynthesis pathway. Proc Natl Acad Sci USA 88: 6532-6536]. This gene 
encodes a 36-kDa polypeptide of 307 amino acids with a hydrophobic index of - 
0.4. The deduced amino acid sequence of the cyanobacterial phytoene synthase is 

35 highly conserved with the tomato phytoene synthase (57% identical and 70% 
similar; Ray J A, Bird CR, Maunders M, Grierson D and Schuch W (1987) 
Sequence of pTOMS, a ripening related cDNA from tomato. Nucl Acids Res 15: 
10587-10588]) but is less highly conserved with the crtB sequences from other 
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bacteria (29-32% identical and 48-50% similar with ten gaps in the alignment). 
Both types of enzymes contain two conserved sequence motifs also found in 
prenyl transferases from diverse organisms [see, Bartley GE, Viitanen PV, Pecker 
1, Chamovitz D, Hirschberg J and Scolnik PA (1991) Molecular cloning and 
5 expression in photosynthetic bacteria of a soybean cDNA coding for phytoene 
desaturase, an enzyme of the carotenoid biosynthesis pathway. Proc Natl Acad Sci 
USA 88: 6532-6536; Carattoli A, Romano N, Ballario P, Morelli G and Macino G 
(1991) The Neurospora crassa carotenoid biosynthetic gene (albino 3). J Biol 
Chem 266: 5854-5859; Armstrong GA 5 Hundle BS and Hearst JE (1993) 

10 Evolutionary conservation and structural similarities of carotenoid biosynthesis 
gene products from photosynthetic and nonphotosynthetic organisms. Meth 
Enzymol 214: 297-31 1; Math SK, Hearst JE and Poulter CD (1992) The crtE gene 
in Erwinia herbicola encodes geranylgeranyl diphosphate synthase. Proc Natl 
Acad Sci USA 89: 6761-6764; and, Chamovitz D (1993) Molecular analysis of the 

15 early steps of carotenoid biosynthesis in cyanobacteria: Phytoene synthase and 
phytoene desaturase. Ph.D. Thesis, The Hebrew University of Jerusalem]. It is 
conceivable that these regions in the polypeptide are involved in the binding and/or 
removal of the pyrophosphate during the condensation of two GGPP molecules. 

The crtQ gene encoding (^-carotene desaturase (formerly zds) was cloned 

20 from Anabaena sp. strain PCC 7120 by screening an expression library of 
cyanobacterial genomic DNA in cells of Escherichia coli carrying the Erwinia sp. 
crtB and crtE genes and the cyanobacterial crtP gene [see, Linden H, Vioque A 
and Sandmann G (1993) Isolation of a carotenoid biosynthesis gene coding for £>- 
carotene desaturase from Anabaena PCC 7120 by heterologous complementation. 

25 FEMS Microbiol Lett 106: 99-104]. Since these Escherichia coli cells produce £>- 
carotene, brownish-red pigmented colonies that produced lycopene could be 
identified on the yellowish background of cells producing ^-carotene. The 
predicted ^-carotene desaturase from Anabaena sp. strain PCC 7120 is a 56-kDa 
polypeptide which consists of 499 amino acid residues. Surprisingly, its primary 

30 structure is not conserved with the "plant-type" (crtP gene product) phytoene 
desaturases, but it has considerable sequence similarity to the bacterial-type 
enzyme (crtl gene product) [see, Sandmann G (1993) Genes and enzymes 
involved in the desaturation reactions from phytoene to lycopene. (abstract), 10th 
International Symposium on Carotenoids, Trondheim CLI-2]. It is possible that 

35 the cyanobacterial crtQ gene and crtl gene of other microorganisms originated in 
evolution from a common ancestor. 

The crtL gene for lycopene cyclase (formerly Icy) was cloned from 
Synechococcus sp. strain PCC 7942 utilizing essentially the same cloning strategy 
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as for crtP. By using an inhibitor of lycopene cyclase, 2-(4-methylphenoxy)- 
triethylamine hydrochloride (MPTA), the gene was isolated by transformation of 
the wild-type to herbicide-resistance [see, Cunningham FX Jr, Chamovitz D, 
Misawa N, Gantt E and Hirschberg J (1993) Cloning and functional expression in 
5 Escherichia coli of a cyanobacterial gene for lycopene cyclase, the enzyme that 
catalyzes the biosynthesis of p-carotene. FEBS Lett 328: 130-138]. Lycopene 
cyclase is the product of a single gene product and catalyzes the double cyclization 
reaction of lycopene to p-carotene. The crtL gene product in Synechococcus sp. 
strain PCC 7942 is a 46-kDa polypeptide of 411 amino acid residues. It has no 

10 sequence similarity to the crtY gene product (lycopene cyclase) from Erwinia 
uredovora or Erwinia herbicola. 

The gene for p-carotene hydroxylase (crtZ) and zeaxanthin glycosilase 
(crtX) have been cloned from Erwinia herbicola [see, Hundle B, Alberti M, 
Nievelstein V, Beyer P, Kleinig H, Armstrong GA, Burke DH and Hearst JE 

15 (1994) Functional assignment of Erwinia herbicola EholO carotenoid genes 
expressed in Escherichia coli. Mol Gen Genet 254: 406-416; Hundle BS, Obrien 
DA, Alberti M, Beyer P and Hearst JE (1992) Functional expression of zeaxanthin 
glucosyltransferase from Erwinia herbicola and a proposed diphosphate binding 
site. Proc Natl Acad Sci USA 89: 9321-9325] and from Erwinia uredovora [see, 

20 Misawa N, Nakagawa M, Kobayashi K, Yamano S, Izawa 1, Nakamura K and 
Harashima K (1990) Elucidation of the Erwinia uredovora carotenoid biosynthetic 
pathway by functional analysis of gene products in Escherichia coli. J Bacterid 
172:6704-6712], 

The ketocarotenoid astaxanthin (3,3 f -dihydroxy-P,p-carotene-4,4'-dione) 
25 was first described in aquatic crustaceans as an oxidized form of P-carotene. 
Astaxanthin was later found to be very common in many marine animals and 
algae. However, only few animals can synthesize astaxanthin de novo from other 
carotenoids and most of them obtain it in their food. In the plant kingdom, 
astaxanthin occurs mainly in some species of cyanobacteria, algae and lichens. 
30 However, it is found rarely also in petals of higher plant species [see, Goodwin 
TW (1980) The Biochemistry of the carotenoids, Vol. 1. 2nd Ed, Chapman and 
Hall, London and New York]. 

The function of astaxanthin as a powerful antioxidant in animals has been 
demonstrated [see, Miki W (1991) Biological functions and activities of animal 
35 carotenoids. Pure Appl Chem 63: 141]. Astaxanthin is a strong inhibitor of lipid 
peroxidation and has been shown to play an active role in the protection of 
biological membranes from oxidative injury [see, Palozza P and Krinsky NI (1992) 
Antioxidant effects of carotenoids in vivo and in vitro - an overview. Methods 
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Enzymol 213: 403-420; and, Kurashige M, Okimasu E, Inove M and Utsumi K 
(1990) Inhibition of oxidative injury of biological membranes by astaxanthin. 
Physiol Chem Phys Med NMR 22: 27]. The chemopreventive effects of 
astaxanthin have also been investigated in which astaxanthin was shown to 
significantly reduce the incidence of induced urinary bladder cancer in mice [see, 
Tanaka T, Morishita Y, Suzui M, Kojima T, Okumura A. and Mori H (1994). 
Chemoprevention of mouse urinary bladder carcinogenesis by the naturally 
occurring carotenoid astaxanthin. Carcinogenesis 15: 15]. It has also been 
demonstrated that astaxanthin exerts immunomodulating effects by enhancing 
antibody production [see, Jyonouchi H, Zhang L and Tomita Y (1993) Studies of 
immunomodulating actions of carotenoids. II. Astaxanthin enhances in vitro 
antibody production to T-dependent antigens without facilitating polyclonal B-cell 
activation. Nutr Cancer 19: 269; and, Jyonouchi H, Hill JR, Yoshifumi T and 
Good RA (1991) Studies of immunomodulating actions of carotenoids. I. Effects 
of p-carotene and astaxanthin on murine lymphocyte functions and cell surface 
marker expression in-vitro culture system. Nutr Cancer 16: 93]. The complete 
biomedical properties of astaxanthin remain to be elucidated, but initial results 
suggest that it could play an important role in cancer and tumor prevention, as 
well as eliciting a positive response from the immune system. 

Astaxanthin is the principal carotenoid pigment of salmonids and shrimps 
and imparts attractive pigmentation in the eggs, flesh and skin [see, Torrisen OJ, 
Hardy RW, Shearer KD (1989) Pigmentation of salmonid-carotenoid deposition 
and metabolism in salmonids. Crit Rev Aquatic Sci 1 : 209]. The world-wide 
harvest of salmon in 1991 was approximately 720,000 MT., of which 25-30% 
were produced in a variety of aquaculture facilities [see, Meyers SP (1994) 
Developments in world aquaculture, feed formulations, and role of carotenoids. 
Pure Appl Chem 66: 1069]. This is set to increase up to 460,000 MT. by the year 
2000 [see, Bjorndahl T (1990) The Economics of Salmon Aquaculture. Blackwell 
Scientific, Oxford, pp. 1], The red coloration of the salmonid flesh contributes to 
consumer appeal and therefore affects the price of the final product. Animals 
cannot synthesize carotenoids and they acquire the pigments through the food 
chain from the primary producers - marine algae and phytoplankton. Those grown 
in intensive culture usually suffer from suboptimal color. Consequently, 
carotenoid-containing nourishment is artificially added in aquaculture, at 
considerable cost to the producer. 

Astaxanthin is the most expensive commercially used carotenoid compound 
(todays-1995 market value is of 2,500-3,500 $/kg). It is utilized mainly as 
nutritional supplement which provides pigmentation in a wide variety of aquatic 
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animals. In the Far-East it is used also for feeding poultry to yield a typical 
pigmentation of chickens. It is also a desirable and effective nontoxic coloring for 
the food industry and is valuable in cosmetics. Recently it was reported that 
astaxanthin is a potent antioxidant in humans and thus is a desirable food additive. 
5 Natural (3S,3 l S) astaxanthin is limited in availability. It is commercially 

extracted from some Crustacea species [see, Torrisen OJ, Hardy RW, Shearer KD 
(1989) Pigmentation of salmonid-carotenoid deposition and metabolism in 
salmonids. Crit Rev Aquatic Sci 1: 209]. The (3tf,37?) stereoisomer of 
astaxanthin is produced from Phaffia [a yeast specie, see, An dr ewes AG, Phaff HJ 

10 and Starr MP (1976) Carotenoids of Phaffia rhodozyma, a red-pigmented 
fermenting yeast. Phytochemistry Vol. 15, pp. 1003-1007]. Synthetic astaxanthin, 
comprising a 1:2:1 mixture of the^S^'S)-, (35,370- and (3#,3V?)-isomers is now- 
manufactured by Hoffman-La Roche and sold at a high price (ca. $2,500/Kg) 
under the name "CAROPHYLL Pink" [see, Mayer H (1994) Reflections on 

15 carotenoid synthesis. Pure & Appl Chem, Vol. 66, pp. 93 1-938]. Recently a novel 
gene involved in ketocompound biosynthesis, designated crtW was isolated from 
the marine bacteria Agrobacterium auranticacum and Alcaligenes PC-1 that 
produce ketocarotenoids such as astaxanthin. When the crtW gene was introduced 
into engineered Eschrichia coli that accumulated p-carotene due to Erwinia 

20 carotenogenic genes, the Escherichia coli transformants synthesized canthaxanthin 
a precursor in the synthetic pathway of astaxanthin [see, Misawa N, Kajiwara S, 
Kondo K, Yokoyama A, Satomi Y, Saito T, Miki W and Ohtani T (1995) 
Canthaxanthin biosynthesis by the conversion of methylene to keto groups in a 
hydrocarbon (J-carotene by a single gene. Biochemical and biophysical research 

25 communications Vol. 209, pp. 867-876]. It is therefore desirable to find a 
relatively inexpensive source of (3S 9 3'S) astaxanthin to be used as a feed 
supplement in aquaculture and as a valuable chemical for various other industrial 
uses. 

Although astaxanthin is synthesized in a variety of bacteria, fungi and algae, 
30 the key limitation to the use of biological systems for its production is the low 
yield of and costly extraction methods in these systems compared to chemical 
synthesis. One way to solve these problems is to increase the productivity of 
astaxanthin production in biological systems using recombinant DNA technology. 
This allows for the production of astaxanthin in genetically engineered host which. 
35 in the case of a higher plant, is easy to grow and simple to extract. Furthermore, 
production of astaxanthin in genetically engineered host enables by appropriate 
host selection to use thus produced astaxanthin in for example aquaculture 
applications, devoid of the need for extraction. 
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There is thus a widely recognized need for, and it would be highly 
advantageous to have, a nucleic acid segment which encodes p-C-4-oxygenase, the 
enzyme that converts p-carotene to can thaxan thin, as well as recombinant vector 
molecules comprising a nucleic acid sequence according to the invention, and host 
cells or transgenic organisms transformed or transfected with these vector 
molecules or DNA segment for the biotechnological production of (3-5,3'iS) 
astaxanthin. 

Other features and advantages of the invention will be apparent from the 
following description and from the claims. 

SUMMARY.£)F,IHEmVENIIQN 

It is a general object of this invention to provide a biotechnological method 
for production of (35,3'iS) astaxanthin. 

It is a specific object of the invention to provide a peptide having a P-C-4- 
oxygenase activity and a DNA segment coding for this peptide to enable a 
biotechnological production of astaxanthin and other xanthophylls. 

It is a further object of the invention to provide an RNA segments coding 
for a polypeptide comprising an amino acid sequence corresponding to above 
described peptide. 

It is yet a further object of the invention to provide a recombinant DNA 
molecule comprising a vector and the DNA segment as described above. 

It is still a further object of the invention to provide a host cell containing 
the above described recombinant DNA molecule. 

It is another object of the invention to provide a host transgenic organism 
containing the above described recombinant DNA molecule or the above described 
DNA segment in its cells. 

It is still another object of the invention to provide a host transgenic 
organism which expresses P-C-4-oxygenase activity in chloroplasts and/or 
chromoplasts-containing tissues. 

It is yet another object of the invention to provide a food additive for animal 
or human consumption comprising the above described host cell or transgenic 
organism. 

It is still another object of the invention to provide a method of producing 
astaxanthin using the above described host cell or transgenic organism. 

It is a further object of the invention to provide a method of producing 
canthaxanthin, echinenone, cryptoxanthin, isocryptoxanthin hydroxyechinenone, 
zeaxanthin, adonirubin, and/or adonixanthin using the above described host cell or 
transgenic organism. 
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Further objects and advantages of the present invention will be clear from 
the description that follows. 

In one embodiment, the present invention relates to a DNA segment coding 
for a polypeptide comprising an amino acid sequence corresponding to 
5 Haematococcus pluvialis crtO gene. 

In a further embodiment, the present invention relates to an RNA segment 
coding for a polypeptide comprising an amino acid sequence corresponding to 
Haematococcus pluvialis crtO gene. 

In yet another embodiment, the present invention relates to a polypeptide 
10 comprising an amino acid sequence corresponding to a Haematococcus pluvialis 
crtO gene. 

In a further embodiment, the present invention relates to a recombinant 
DNA molecule comprising a vector and a DNA segment coding for a polypeptide, 
corresponding to a Haematococcus pluvialis crtO gene. 
15 In another embodiment, the present invention relates to a host cell 

containing the above described recombinant DNA molecule or DNA segment. 

In a further embodiment, the present invention relates to a host transgenic 
organism containing the above described recombinant DNA molecule or the above 
described DNA segment in its cells. 
20 In another embodiment, the present invention relates to a method of 

producing astaxanthin using the above described host cell or transgenic organism. 

In yet another embodiment, the present invention relates to a method of 
producing other xanthophylls. 

In still another embodiment, the present invention relates to a method of 
25 obtaining high expression of a transgene in plants specifically in chromoplasts- 
containing cells. 

In one farther embodiment, the present invention relates to a method of 
importing a carotenoid-biosynthesis enzyme encoded by a transgene into 
chromoplasts. 

30 

BRIEF DESCRIPTION OF THE DRAWINGS 

The invention herein described, by way of example only, with reference to 
the accompanying drawings, wherein: 

FIG. 1 is a general biochemical pathway of (3-carotene biosynthesis, in 
35 which pathway all molecules are depicted in an a\l-trans configuration, wherein 
1PP is isopentenyl pyrophosphate, DMAPP is dimethylallyl pyrophosphate, GPP is 
geranyl pyrophosphate, FPP is farnesyl pyrophosphate, GGPP is geranylgeranyl 
pyrophosphate and, PPPP is prephytoene pyrophosphate; 
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FIG. 2 is an identity map between the nucleotide sequence of the crtO 
cDNA of the present invention (CRTOA.SEQ) and the cDNA cloned by Kajiwara 
et al, (CRTOJ.SEQ) [see, Kajiwara S, Kakizono T, Saito T, Kondo K, Ohtani T, 
Nishio N, Nagai S and Misawa N (1995) Isolation and functional identification of 
5 a novel cDNA for astaxanthin biosynthesis from Haematococcus pluvialis, and 
astaxanthin synthesis in Escherichia coli. Plant Molec Biol 29: 343-352], using a 
GCG software, wherein (:) indicate identity, (-) indicate a gap and nucleotides 
numbering is according to SEQ ID NO:4 for CRTOA.AMI and Kajiwara et al, for 
CRTOJ.AMI; 

10 FIG. 3 is an identity map between the amino acid sequence encoded by the 

crtO cDNA of the present invention (CRTOA.AMI) and the amino acid sequence 
encoded by the cDNA cloned by Kajiwara et aL 9 (CRTOJ.AMI) [see, Kajiwara S, 
Kakizono T, Saito T, Kondo K, Ohtani T, Nishio N, Nagai S and Misawa N (1995) 
Isolation and functional identification of a novel cDNA for astaxanthin 

15 biosynthesis from Haematococcus pluvialis, and astaxanthin synthesis in 
Escherichia coli. Plant Molec Biol 29: 343-352], using a GCG software, wherein 
(:) indicate identity, (-) indicate a gap and amino acids numbering is according to 
SEQ ID NO:4 for CRTOA.AMI and Kajiwara et ai % for CRTOJ.AMI; 

FIG. 4 is a schematic depiction of a pACYC184 derived plasmid designated 

20 pBCAR and includes the genes crtE, crtB, crtl and crtY of Erwinia herbicola, 
which genes are required for production of p-carotene in Escherichia coli cells; 

FIG. 5 is a schematic depiction of a pACYC184 derived plasmid designated 
pZEAX and includes the genes crtE, crtB, crtl, crtY and crtZ from Erwinia 
herbicola, which genes are required for production of zeaxanthin in Escherichia 

25 coli cells; 

FIG. 6 is a schematic depiction of a pBluescriptSK" derived plasmid 
designated pHPK, containing a full length cDNA insert encoding a P-carotene C- 
4-oxygenase enzyme from Haematococcus pluvialis, designated crtO and set forth 
in SEQ ID NO:l, which cDNA was identified by color complementation of 

30 Escherichia coli cells; 

FIG. 7 is a schematic depiction of a pACYC184 derived plasmid designated 
pCANTHA which was derived by inserting a 1.2 kb Pst\-Pst\ DNA fragment, 
containing the cDNA encoding the P-C-4-oxygenase from Haematococcus 
pluvialis isolated from the plasmid pHPK of Figure 6 and inserted into a Pstl site 

35 in the coding sequence of the crtZ gene in the plasmid pZEAX of Figure 5; this 
recombinant plasmid carries the genes crtE, crtB, crtl, crtY of Erwinia herbicola 
and the crtO gene of Haematococcus pluvialis, all required for production of 
canthaxanthin in Escherichia coli cells; 
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FIG. 8 is a schematic depiction of a pACYCl 84 derived plasmid designated 
pASTA which was derived by inserting the 1 .2 kb Pstl-Pstl DNA fragment, 
containing the cDNA of the (5-C-4-oxygenase from Haematococcus pluvialis 
isolated from the plasmid pHPK of Figure 6 and inserted into a Pstl site which 
exists 600 bp downstream of the crtE gene in the plasmid pZEAX of Figure 5; this 
recombinant plasmid carries the genes crtE, crtB, crtl, crtY, crtZ of Erwinia 
herbicola and the crtO gene of Haematococcus pluvialis, all required for 
production of astaxanthin in Escherichia coli cells; 

FIG. 9 is a schematic depiction of a pBR328 derived plasmid designated 
PAN3.5-KETO which was derived by inserting the 1.2 kb Pstl-Pstl DNA 
fragment, containing the cDNA of the P-C-4-oxygenase from Haematococcus 
pluvialis isolated from the plasmid pHPK of Figure 6 and inserted into a Pstl site 
which exists in a (5-lactamase gene in a plasmid designated pPAN35D5 [described 
in Hirschberg J, Ohad N, Pecker I and Rahat A (1987) Isolation and 
characterization of herbicide resistant mutants in the cyanobacterium 
Synechococcus R2. Z. Naturforsch 42c: 102-112], which carries the psbAI gene 
from the cyanobacterium Synechococcus PCC7942 in the plasmid vector pBR328 
[see, Hirschberg J, Ohad N, Pecker I and Rahat A (1987) Isolation and 
characterization of herbicide resistant mutants in the cyanobacterium 
Synechococcus R2. Z. Naturforsch 42c: 102-1 12]; this recombinant plasmid carries 
the crtO gene of Haematococcus pluvialis, required for production of astaxanthin 
in Synechococcus PCC7942 cells; 

FIG. 10 is a schematic depiction of the T-DNA region of a Ti binary 
plasmid (E. coli, Agrobacterium) designated pBIB [described by Becker D 
(1990) Binary vectors which allow the exchange of plant selectable markers 
and reporter genes. Nucleic Acids Research 18:230] which is a derivative of the 
Ti plasmid pBHOl [described by Jeffesrson AR, Kavanagh TA and Bevan WM 
(1987) GUS fusions: p-glucuronidase as a sensitive and versatile gene fusion 
marker in higher plants. The EMBO J. 6: 3901-3907], wherein Br and Bl are 
the right and left borders, respectively, of the T-DNA region, pAg7 is the 
polyadenylation site of gene 7 of Agrobacterium Ti-plasmid, pAnos is a 250 bp 
long DNA fragment containing the poly adenylation site of the nopaline 
synthase gene of Agrobacterium, NPT II is a 1,800 bp long DNA fragment 
coding for kanamycin resistance, pnos is a 300 bp long DNA fragment 
containing the promoter sequence of the nopaline synthase gene of 
Agrobacterium, whereas pAnos is a 300 bp long DNA fragment containing the 
poly adenylation site of the nopaline synthase gene of Agrobacterium', 
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FIG. 11 is a schematic depiction of the T-DNA region of a Ti binary 
plasmid (£. coli, Agrobacterium) designated pPTBIB which was prepared by 
cloning a genomic DNA sequence of a tomato species Lycopersicon 
esculentum marked PT (nucleotides 1 to 1448 of the Pds gene as published in 

5 Mann V, Pecker I and Hirschberg J (1994) cloning and characterization of the 
gene for phytoene desaturase (Pds) from tomato {Lycopersicon esculentum). 
Plant Molecular Biology 24: 429-434), which contains the promoter of the Pds 
gene and the coding sequence for the amino terminus region of the polypeptide 
PDS that serve as a transit peptide for import into chloroplasts and 

10 chromoplasts, into a Hindlll-Smal site of the binary plasmid vector pBIB of 
Figure 10, wherein Br and Bl> pAg7, pAnos, NPT II, pnos and pAnos are as 
defined above; 

FIG. 12 is a schematic depiction of the T-DNA region of a Ti binary 
plasmid (E. coli, Agrobacterium) designated pPTCRTOBIB which was 

15 prepared by cloning a 1,110 nucleotide long EcoAl\\\-Nco\ fragment of the 
cDNA of crtO from K pluvialis (nucleotides 211 to 1321 of SEQ ID NO:l) 
into the Smal site of the plasmid pPTBIB of Figure 11, such that the coding 
nucleotide sequence of the amino terminus of PDS is in the same reading frame 
of crtO, wherein Br and Bl, pAg7 5 pAnos, NPT II, pnos, and pAnos are as 

20 defined above, PT is the promoter and transit peptide coding sequences of Pds 
from tomato and CRTO is the nucleotide sequence of crtO from H. pluvialis 
(nucleotides 211 to 1321 of SEQ IDNO:l); 

FIG. 13 shows a Southern DNA blot analysis of i/z>2<flll-digested genomic 
DNA extracted from wild type (WT) and crtO tobacco transgenic plants, 

25 designated 2, 3, 4, 6, 9 and 10, according to the present invention, using the crtO 
cDNA as a radioactive probe essentially as described in Sambrook et ah, 
Molecular Cloning; A Laboratory Manual. Cold Spring Harbor Laboratory, Cold 
Spring Harbor, N.Y. 1989, wherein the size of marker (M) DNA fragments in 
kilobase pairs (kb) is indicated on the left as well as the expected position (arrow) 

30 of an internal T-DNA Hindlll fragment as was deduced from the sequence of 
pPTPDSBIB shown in Figure 12 which contain the crtO cDNA sequence; 
FIG. 14 shows a biosynthesis pathway of astaxanthin; 
FIG. 15 shows a flower from a wild type tobacco plant and a flower from a 
transgenic tobacco plant according to the present invention. 

35 

DESCRIPTION OF THF. PREFERRED EMBODIMENTS 

The present invention is, in general, of a biotechnological method for 
production of (3S,3\S) astaxanthin. In particular, the present invention is of a 



WO 98/18910 PCTYUS97/17819 

24 

peptide having a p-C-4-oxygenase activity; a DNA segment coding for this 
peptide; an RNA segments coding for this peptide; a recombinant DNA molecule 
comprising a vector and the DNA segment; a host cell or organism containing the 
above described recombinant DNA molecule or DNA segment; and of a method 
5 for biotechnologically producing (SSJi'S) astaxanthin or a food additive containing 
(3S,yS) astaxanthin, using the host. 

The unicellular fresh-water green alga Haematococcus pluvialis 
accumulates large amounts of (3S 5 3 f *S0 astaxanthin when exposed to unfavorable 
growth conditions, or following different environmental stresses such as phosphate 

10 or nitrogen starvation, high concentration of salt in the growth medium or high 
light intensity [see, Yong YYR and Lee YK (1991) Phycologia 30 257-261; Droop 
MR (1954) Arch Microbiol 20: 391-397; and, Andrewes A.G, Borch G, Liaaen- 
Jensen S and Snatzke G.(1974) Acta Chem Scand B28: 730-736]. During this 
process, the vegetative cells of the alga form cysts and change their color from 

15 green to red. The present invention discloses the cloning of a cDNA from 
Haematococcus pluvialis, designated crtO, which encodes a P-C-4-oxygenase, the 
enzyme that converts p-carotene to canthaxanthin, and its expression in a 
heterologous systems expressing P-carotene hydroxylase (e.g., Erwinia herbicola 
crtZ gene product), leading to the production of (3S,yS) astaxanthin. 

20 The crtO cDNA and its encoded peptide having a p-C-4-oxygenase activity 

are novel nucleic and amino acid sequences, respectively. The cloning method of 
the crtO cDNA took advantage of a strain of Escherichia colL which was 
genetically engineered to produce p-carotene, to which a cDNA library of 
Haematococcus pluvialis was transfected and expressed. Visual screening for 

25 brown-red pigmented Escherichia coli cells has identified a canthaxanthin 
producing transformant. Thus cloned cDNA has been expressed in two 
heterologous systems (Escherichia coli and Synechococcus PCC7942 cells) both 
able to produce P-carotene and further include an engineered {Erwinia herbicola 
crtZ gene product) or endogenous P-carotene hydroxylase activity, and was shown 

30 to enable the production of (3S,3'S) astaxanthin in both these systems. 

The crtO cDNA or its protein product exhibit no meaningful nucleic- or 
amino acid sequence similarities to the nucleic- or amino acid sequence of crtW 
and its protein product isolated from the marine bacteria Agrobacterium 
auranticacum and Alcaligenes PC-1 that produce ketocarotenoids such as 

35 astaxanthin [see, Misawa N, Kajiwara S, Kondo K, Yokoyama A, Satomi Y, Saito 
T, Miki W and Ohtani T (1995) Canthaxanthin biosynthesis by the conversion of 
methylene to keto groups in a hydrocarbon p-carotene by a single gene. 
Biochemical and biophysical research communications Vol. 209, pp. 867-876]. 
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However, the crtO cDNA and its protein product exhibit substantial 
nucleic- and amino acid sequence identities with the nucleic- and amino acid 
sequence of a recently cloned cDNA encoding a 320 amino acids protein product 
having P-carotene oxygenase activity, isolated from Haematococcus pluvialis [see, 
5 Kajiwara S, Kakizono T, Saito T, Kondo K, Ohtani T, Nishio N, Nagai S and 
Misawa N (1995) Isolation and functional identification of a novel cDNA for 
astaxanthin biosynthesis from Haematococcus pluvialis, and astaxanthin synthesis 
in Escherichia coli. Plant Molec Biol 29: 343-352], Nevertheless, as presented in 
Figure 2 the degree of sequence identity between the crtO cDNA (CRTOA.SEQ in 

io Figure 2) and the cDNA described by Kajiwara et al (CRTOJ.SEQ in Figure 2) 
[see reference above] is 75.7% and, as presented in Figure 3 the degree of 
sequence identity between the crtO cDNA protein product (CRTOA.AMI in 
Figure 3) and the protein described by Kajiwara et al. (CRTOJ.AMI in Figure 3) is 
78%, as was determined using a GCG software. 

15 As will be described in details hereinbelow, the crtO cDNA can thus be 

employed to biotechnologically produce (35,3*5) astaxanthin in systems which are 
either easy to grow and can be used directly as an additive to fish food, or systems 
permitting a simple and low cost extraction procedure of astaxanthin. 

In one embodiment, the present invention relates to a DNA segment coding 

20 for a polypeptide comprising an amino acid sequence corresponding to 
Haematococcus pluvialis crtO gene and allelic and species variations and 
functional naturally occurring and/or man-induced variants thereof The phrase 
'allelic and species variations and functional naturally occurring and/or man- 
induced variants' as used herein and in the claims below refer to the source of the 

25 DNA (or RNA as described below) or means known in the art for obtaining it. 
However the terms 'variation' and 'variants 1 indicate the presence of sequence 
dissimilarities (i.e., variations). It is the intention herein and in the claims below 
that the sequence variations will be 77-80%, preferably 80-85%, more preferably 
85-90%, most preferably 90-100% of identical nucleotides. In a preferred 

30 embodiment the DNA segment comprises the sequence set forth in SEQ ID NO:L 
In another preferred embodiment, the DNA segment encodes the amino acid 
sequence set forth in SEQ ID NO:4. 

The invention also includes a pure DNA segment characterized as including 
a sequence which hybridizes under high stringency conditions [e.g., as described in 

35 Sambrook et aL 9 Molecular Cloning; A Laboratory Manual. Cold Spring Harbor 
Laboratory, Cold Spring Harbor, N.Y. 1989] to a nucleic acid probe which 
includes at least fifteen, preferably at least fifty, more preferably at least hundred, 
even more preferably at least two hundred, even more preferably at least five 
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hundred successive nucleotides of SEQ ID NO:l or SEQ ID NO:2. Alternatively, 
the DNA segment of the invention may be characterized as being capable of 
hybridizing under low-stringent conditions to a nucleic acid probe which includes 
the coding sequence (nucleotides 166 through 1 152) of SEQ ID NO;l or SEQ ID 
5 NO:2. An example of such low-stringency conditions is as described in Sambrook 
et aL. using a lower hybridization temperature, such as, for example, 20°C below 
the temperature employed for high-stringency hybridization conditions, as 
described above. 

The DNA segment of the invention may also be characterized as being 
io capable of hybridizing under high-stringent conditions to a nucleic acid probe 
which includes the coding sequence (nucleotides 166 through 1152) of SEQ ID 
NO:l orSEQIDNO:2. 

The invention also includes a synthetically produced oligonucleotide (e.g., 
oligodeoxyribonucleotide or oligoribonucleotide and analogs thereof) capable of 
15 hybridizing with at least ten-nucleotide segments of SEQ ID NO:l or SEQ ID 
NO:2. 

In another embodiment, the present invention relates to an RNA segment 
coding for a polypeptide comprising an amino acid sequence corresponding to 
Haernatococcus pluvialis crtO gene and allelic and species variations and 

20 functional naturally occurring and/or man-induced variants thereof. In a preferred 
embodiment the RNA segment comprises the sequence set forth in SEQ ID NO:2. 
In another preferred embodiment, the RNA segment encodes the amino acid 
sequence set forth in SEQ ID NO:4. 

The invention also includes a pure RNA characterized as including a 

25 sequence which hybridizes under high stringent conditions to a nucleic acid probe 
which includes at least at least fifteen, preferably at least fifty, more preferably at 
least hundred, even more preferably at least two hundred, even more preferably at 
least five hundred succsesive nucleotides of SEQ ID NO:l or SEQ ID NO:2. 
Alternatively, the RNA of the invention may be characterized as being capable of 

30 hybridizing under low-stringent conditions to a nucleic acid probe which includes 
the coding sequence (nucleotides 166 through 1152) of SEQ ID NO:l or SEQ ID 
NO:2. Additionally, the RNA of the invention may be characterized as being 
capable of hybridizing under high-stringent conditions to a nucleic acid probe 
which includes the coding sequence (nucleotides 166 through 1152) of SEQ ID 

35 NO:l orSEQIDNO:2. 

In another embodiment, the present invention relates to a polypeptide 
comprising an amino acid sequence corresponding to a Haernatococcus pluvialis 
crtO gene and allelic, species variations and functional naturally occurring and/or 
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man-induced variants thereof. In a preferred embodiment, the polypeptide 
comprises the amino acid sequence set forth in SEQ ID NO:4. 

It should be noted that the invention includes any peptide which is 
homologous (i.e., 80-85%, preferably 85-90%, more preferably 90-100% of 

5 identical amino acids) to the above described polypeptide. The term 'homologous' 
as used herein and in the claims below, refers to the sequence identity between two 
peptides. When a position in both of the two compared sequences is occupied by 
identical amino acid monomeric subunits, it is homologous at that position. The 
homology between two sequences is a function of the number of homologous 

10 positions shared by the two sequences. For example, if eight of ten of the positions 
in two sequences are occupied by identical amino acids then the two sequences are 
80% homologous. 

Other polypeptides which are also included in the present invention are 
allelic variations, other species homologs, natural mutants, induced mutants and 
15 peptides encoded by DNA that hybridizes under high or low stringency conditions 
(see above) to the coding region (nucleotides 166 through 1 152) of SEQ ID NO:l 
or SEQ ID NO:2. 

In another embodiment, the present invention relates to a recombinant DNA 
molecule comprising a vector (for example plasmid or viral vector) and a DNA 

20 segment coding for a polypeptide, as described above. In a preferred embodiment, 
the DNA segment is present in the vector operably linked to a promoter. 

In a further embodiment, the present invention relates to a host cell 
containing the above described recombinant DNA molecule or DNA segment. 
Suitable host cells include prokaryotes (such as bacteria, including Escherichia 

25 coli) and both lower eukaryotes (for example yeast) and higher eukaryotes (for 
example, algae, plant or animal cells). Introduction of the recombinant molecule 
into the cell can be effected using methods known in the art such as, but not 
limited to, transfection, transformation, micro-injection, gene bombardment etc. 
The cell thus made to contain the above described recombinant DNA molecules. 

30 may be grown to form colonies or may be made to differentiate to form a 
differentiated organism. The recombinant DNA molecule may be transiently 
contained (e.g., by a process known in the art as transient transfection) in the celh 
nevertheless, it is preferred that the recombinant DNA molecule is stably contained 
(e.g., by a process known in the art as stable transfection) in the cell. Yet in a 

35 preferred embodiment the cell is endogenously producing, or is made by genetic 
engineering means to produce, P-carotene, and the cell contains endogenous or 
genetically engineered P-carotene hydroxylase activity. Such a cell may be used 
as a food additive for animal (e.g., salmon) and human consumption. Furthermore. 



WO 98/18910 



PCT7US97/17819 



such a cell may be used for extracting astaxanthin and/or other xanthophylls, as 
described hereinbelow. 

In a further embodiment, the present invention relates to a host transgenic 
organism (e.g., a higher plant or animal) containing the above described 
5 recombinant DNA molecule or the above described DNA segment in its cells. 
Introduction of the recombinant molecule or the DNA segment into the host 
transgenic organism can be effected using methods known in the art. Yet, in a 
preferred embodiment the host organism is endogenously producing, or is made by 
genetic engineering means to produce, P-carotene and, also preferably the host 

10 organism contains endogenous or genetically engineered p-carotene hydroxylase 
activity. Such an organism may be used as a food additive for animal (e.g., 
salmon) and human consumption. Furthermore, such an organism may be used for 
extracting astaxanthin and/or other xanthophylls, as described hereinbelow. 

In another embodiment, the present invention relates to a method of 

15 producing astaxanthin using the above described host cell or transgenic organism. 
In yet another embodiment, the present invention relates to a method of producing 
xanthophylls such as canthaxanthin, echinenone, cryptoxanthin, isocryptoxanthin, 
hydroxy echinenone, zeaxanthin, adonirubin, 3 -hydroxy echinenone, 3'- 
hydroxyechinenone and/or adonixanthin using the above described host cell or 

20 transgenic organism. For these purposes provided is a cell or a transgenic 
organism as described above. The host cell or organism are made to grow under 
conditions favorable of producing astaxanthin and the above listed additional 
xanthophylls which are than extracted by methods known in the art. 

In yet another embodiment, the present invention relates to a transgenic 

25 plant expressing a transgene coding for a polypeptide including an amino acid 
sequence corresponding to Haematococcus pluvialis crtO gene, allelic and species 
variants or functional naturally occurring or man-induced variants thereof. 
Preferably the expression is highest in chromoplasts-containing tissues. 

In yet another embodiment, the present invention relates to a recombinant 

30 DNA vector which includes a first DNA segment encoding a polypeptide for 
directing a protein into plant chloroplasts or chromoplasts (e.g., derived from the 
Pds gene of tomato) and an in frame second DNA segment encoding a polypeptide 
including an amino acid sequence corresponding to Haematococcus pluvialis crtO 
gene, allelic and species variants or functional naturally occurring and man- 

35 induced variants thereof. 

In yet another embodiment, the present invention relates to a recombinant 
DNA vector which includes a first DNA segment including a promoter highly 
expressible in plant chloroplasts or chromoplasts-containing tissues (e.g., derived 
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from the Pds gene of tomato) and a second DNA segment encoding a polypeptide 
including an amino acid sequence corresponding to Haematococcus pluvialis crtO 
gene ; allelic and species variants or functional naturally occurring and man- 
induced variants thereof. 

Reference in now made to the following examples, which together with the 
above descriptions, illustrate the invention. 

EXAMPLES 

The following protocols and experimental details are referenced in the 
Examples that follow: 

Algae and growth conditions. Haematococcus pluvialis (strain 34/7 from 
the Culture Collection of Algae and Protozoa, Windermere, UK) was kindly 
provided by Dr. Andrew Young from the Liverpool John Moores University. 
Suspension cultures of the alga were grown in a liquid medium as described by 
Nichols and Bold [see, Nichols HW, Bold HC (1964) Trichsarcina polymorpha 
gen et sp nov J Phycol 1: 34-39]. For induction of astaxanthin biosynthesis cells 
were harvested, washed in water and resuspended in a nitrogen-depleted medium. 
The cultures were maintained in 250 ml Erlenmeyer flasks under continuous light 
(photon flux of 75 |oE/m2/s), at 25°C, on a rotary shaker at 80 rpm. 

Construction of cDNA library. The construction of a cDNA library from 
Haematococcus pluvialis was described in detail by Lotan and Hirschberg (1995) 
FEBS letters 364: 125-128. Briefly, total RNA was extracted from algal cells 
grown for 5 days under nitrogen-depleted conditions (cell color brown-red). Cells 
from a 50 ml culture were harvested and their RNA content was extracted using 
Tri reagent (Molecular Research Center, INC.). Poly-An RNA was isolated by 
two cycles of fractionation on oligo dT-cellulose (Boehringer). The final yield was 
1.5% of the total RNA. The cDNA library was constructed in a Uni-ZAP™ XR 
vector, using a ZAP-cDNA synthesis kit (both from Stratagene). Escherichia coli 
cells of strain XL 1 -Blue MRF' (Stratagene) were used for amplification of the 
cDNA library. 

Plasmids and Escherichia coli strains. Plasmid pPL376, which contains 
the genes necessary for carotenoid biosynthesis in the bacterium Erwinia herbicola 
was obtained from Tuveson [for further details regarding plasmid pPL376 see, 
Tuveson RW, Larson RA & Kagan J (1988) Role of cloned carotenoid genes 
expressed in Escherichia coli in protecting against inactivation by near-UV light 
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and specific phototoxic molecules. J Bacteriol 170: 4675-4680]. Cells of 
Escherichia coli strain JM109 that carry the plasmid pPL376 accumulate the bright 
yellow carotenoid, zeaxanthin glycoside. In a first step, a 1.1 kb SaR-Sall 
fragment was deleted from this plasmid to inactivate the gene crtX, coding for 
5 zeaxanthin glucosyl transferase. In a second step, partial BamHl cleavage of the 
plasmid DNA, followed by self ligation, deleted a 0.8 kb fragment which 
inactivated cr/Z, encoding p-carotene hydroxylase, A partial BglU cleavage 
generated a fragment of 7.4 kb which was cloned in the BamHl site of the plasmid 
vector pACYC184. As shown in Figure 4, the resulting recombinant plasmid, 

10 which carried the genes crtE, crtB, crtl and crtY, was designated pBCAR [Lotan 
and Hirschberg (1995) FEBS letters 364: 125-128]. 

Plasmid pBCAR was transfected into SOLR strain cells of Escherichia coli 
(Stratagene). Colonies that appeared on chloramphenicol-containing Luria Broth 
(LB) medium [described in Sambrook et aL, Molecular Cloning; A Laboratory 

15 Manual. Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. 1989], carried 
this plasmid and developed a deep yellow-orange color due to the accumulation of 
P-carotene. 

As shown in Figure 5, an additional plasmid, designated pZEAX, which 
allows for zeaxanthin synthesis and accumulation in Escherichia coli was 

20 constructed [this plasmid is described in details in Lotan and Hirschberg (1995) 
FEBS letters 364: 125-128]. SOLR strain Escherichia coli cells were used as a 
host for the pZEAX plasmid. Escherichia coli cells were grown on LB medium 
(see above), at 37°C in the dark on a rotary shaker at 225 rpm. Ampicillin (50 p 
g/ml) and/or chloramphenicol (30 pg/ml) (both from Sigma) were added to the 

25 medium for selection of appropriate transformed cells. 

As shown in Figure 6, a plasmid, pHPK, containing the full length cDNA of 
the (3-carotene C-4-oxygenase enzyme was identified by color complementation as 
described by Lotan and Hirschberg (1995) FEBS letters 364: 125-128 (see 
description herein below). A 1.2 kb Pst\-Pst\ DNA fragment, containing the 

30 cDNA of the p-C-4-oxygenase from Haematococcus pluvialis, was isolated from 
plasmid pHPK and inserted into a Pstl site in the coding sequence of the criZ gene 
in the plasmid pZEAX. This recombinant plasmid was designated pCANTHA and 
is shown in Figure 7. 

The same 1 .2 kb Pstl-Pstl fragment was also inserted into a Pstl site which 

35 exists 600 bp downstream of the crtE gene in the plasmid pZEAX. The resulting 
recombinant plasmid was designated pASTA and is shown in Figure 8. 

The same 1 .2 kb Pstl-Psil fragment was also inserted into a Pstl site which 
exists in the p-lactamase gene in the plasmid pPAN35D5 [Hirschberg J, Ohad R 
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Pecker I and Rahat A (1987) Isolation and characterization of herbicide resistant 
mutants in the cyanobacterium Synechococcus R2. Z. Naturforsch 42c: 102-112], 
which carries the psbAI gene from the cyanobacterium Synechococcus PCC7942 in 
the plasmid vector pBR328 [Hirschberg J, Ohad N, Pecker I and Rahat A (1987) 
5 Isolation and characterization of herbicide resistant mutants in the cyanobacterium 
Synechococcus R2. Z. Naturforsch 42c: 102-112]. This plasmid was designated 
PAN3.5-KETO and is shown in Figure 9. This plasmid was used in the 
transformation of Synechococcus PCC7942 cells following procedures described 
by Golden [Golden SS (1988) Mutagenesis of cyanobacteria by classical and 

io gene-transfer-based methods. Methods Enzymol 167: 714-727]. 

Excision of phage library and screening for a P-carotene oxygenase 
gene. Mass excision of the cDNA library, which was prepared as described 
hereinabove, was carried out using the ExAssist helper phage (Stratagene) in cells 
of SOLR strain of Escherichia coli that carried the plasmid pBCAR. The excised 

15 library in phagemids form was transfected into Escherichia coli cells strain XL1- 
Blue and the cells were plated on LB plates containing 1 mM isopropylthio-P-D- 
galactosidase (IPTG), 50 |ig/ml ampicillin and 30 |ig/ml chloramphenicol, in a 
density that yielded approximately 100-150 colonies per plate. The plates were 
incubated at 37°C overnight and further incubated for two more days at room 

20 temperature. The plates were then kept at 4°C until screened for changes in colony 
colors. 

A plasmid for high expression of crtO in chromoplasts. As shown in 
Figures 10-11, a genomic DNA sequence of a tomato species Lycopersicon 
esculentum (nucleotides 1 to 1448 of the Pds gene [as published in Mann V, 

25 Pecker I and Hirschberg J (1994) cloning and characterization of the gene for 
phytoene desaturase (Pds) from tomato {Lycopersicon esculentum). Plant 
Molecular Biology 24: 429-434], which contains the promoter of the Pds gene 
and the coding sequence for the amino terminus region of the polypeptide PDS 
that serve as a transit peptide for import into chloroplasts and chromoplasts, 

30 was cloned into a HindllhSmal site of the binary plasmid vector pBIB, 
[described by Becker D (1990) Binary vectors which allow the exchange of 
plant selectable markers and reporter genes. Nucleic Acids Research 18:230], 
shown in Figure 10. The recombinant plasmid was designated pPTBIB and is 
shown in Figure 11. 

35 As shown in Figure 12, a 1,1 10 nucleotide long £co47III-AfcoI fragment, 

containing the cDNA of crtO from H. pluvialis (nucleotides 211 to 1321 of 
SEQ ID NO:l) was sub-cloned into the Smal site of the plasmid pPTBIB 
(Figure 11) so that the coding nucleotide sequence of the amino terminus of 
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Pds is in the same reading frame as crtO. The recombinant plasmid was 
designate pPTCRTOBIB. 

Formation of transgenic higher plant. The DNA of pPTCRTOBIB 
was extracted from E. coli cells and was transferred into cells of 
5 Agrobacteriwn tumefaciens strain EHA105 [described by Hood EE, Gelvin SB, 
Melchers LS and Hoekema A (1993) Transgenic Research 2:208-218] using 
electroporation as described for E. coli [Dower JW, Miller FJ and Ragdsale 
WC (1988) High efficiency transformation of E. coli by high voltage 
electroporation. Nuc. Acids Res. 18: 6127-6145]. Agrobacterium cells were 

io grown at 28 °C in LB medium supplemented with 50 fig/ml streptomycin and 
50 (ag/ml kanamycin as selective agents. Cells of Agrobacterium carrying 
pPTCRTOBIB were harvested from a suspension culture at the stationary phase 
of growth and used for transformation as described by Horsch RB, Fry JE, 
Hoffmann NL, Eicholtz D, Rogers SG and Fraley RT, A simple and general 

15 method for transferring genes into plants. Science (1985) 227:1229-1231; and 
Jeffesrson AR, Kavanagh TA and Bevan WM (1987) GUS fusions: p- 
glucuronidase as a sensitive and versatile gene fusion marker in higher plants. 
The EMBOJ. 6: 3901-3907. 

Leaf explants of Nicotiana tobaccum strain NN were infected with the 

20 transformed Agrobacterium cells and kanamycin-resistant transgenic plants 
were regenerated according to protocols described by Horsch et al. (1985) and 
Jefferson et al. (1987) cited above. 

With reference now to Figure 13, the presence of the DNA sequence of the 
crtO gene-construct in the fully developed regenerated plants was determined by 

25 DNA Southern blot analysis. To this end DNA was extracted from the leaves 
[according to a protocol described by Kanazawa and Tsutsumi (1992) Extraction 
of restrictable DNA from plants of the genus Nelumbo. Plant Molecular Biology 
Reports 10: 316-318], digested with the endonuclease HindlU, the fragments were 
size separated by gel electrophoresis and hybridized with radioactively labeled 

30 crtO sequence (SEQ ID NO: 1 ). 

It was determined that each transgenic plant that was examined contained at 
least one copy of the crtO DNA sequence, yielding a 1.75 kb band (arrow), 
originating from an internal HincHU-HindiW fragment of the T-DNA of 
pPTCRTOBIB, additional bands originating from partial digestion, additional 

35 band/s whose sizes vary, depending on the position of insertion in the plant 
genome and a 1 .0 kb band originating from the tobacco plant itself which therefore 
also appears in the negative control WT lane. 
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Sequence analysis. DNA sequence analysis was carried out by the dideoxy 
method [see, Sanger F, Nicklen S & Coulsen AR (1977) DNA sequencing with 
chain termination inhibitors. Proc Natl Acad Sci USA 74: 5463-5467]. 

Carotenoids analysis. Aliquots of Escherichia coli cells which were 
5 grown in liquid in LB medium were centrifuged at 13,000 g for 10 minutes, 
washed once in water and re-centrifuged. After removing the water the cells were 
resuspended in 70 |il of acetone and incubated at 65°C for 15 minutes. The 
samples were centrifuged again at 13,000 g for 10 minutes and the carotenoid- 
containing supernatant was placed in a clean tube. The carotenoid extract was 

10 blown to dryness under a stream of nitrogen (N2) gas and stored at -20°C until 
required for analysis. Carotenoids from plant tissues were extracted by mixing 
0.5-1.0 gr of tissue with 100 \x\ of acetone followed by incubation at 65°C for 15 
minutes and then treating the samples as described above. 

High-performance liquid chromatography (HPLC) of the carotenoid 

15 extracts was carried out using an acidified reverse-phase Cl8 column, Spherisorb 
ODS-2 (silica 5 |im 4.6 mm x 250 mm) (Phenomenex®). The mobile phase was 
pumped by triphasic Merck-Hitachi L-6200A high pressure pumps at a flow rate of 
1.5 ml/min. The mobile phase consisted of an isocratic solvent system comprised 
of hexane/dichloromethane/isopropyl alcohol/triethylamine (88.5:10:1.5:0.1, v/v). 

20 Peaks were detected at 470 nm using a Waters 996 photodiode-array detector. 
Individual carotenoids were identified by their retention times and their typical 
absorption spectra, as compared to standard samples of chemically pure (3- 
carotene, zeaxanthin, echinenone, canthaxanthin, adonirubin and astaxanthin 
(The latter four were kindly provided by Dr. Andrew Young from Liverpool John 

25 Moores University). 

Thin layer chromatography (TLC) was carried out using silica gel 60 F254 
plates (Merck), using ethyl acetate/benzene (7:3, v/v) as an eluent. Visible 
absorption spectra were recorded with a Shimadzu UV-160A spectrophotometer. 
All spectra were recorded in acetone. Spectral fine structure was expressed in 

30 terms of %III/II [Britton, G. (1995). UV/Visible Spectroscopy. In: Carotenoids; 
Vol IB, Spectroscopy. Eds. Britton G, Liaaen-Jensen S and Pfander H. 
Birkhauser Verlag, Basel, pp. 13-62]. 

Isolation and identification of the carotenoids extracted from cells of E. coli 
are treated in order of increasing adsorption (decreasing Revalues) on silica TLC 

35 plates. Carotenoids structure and the biosynthesis pathway of astaxanthin are 
given in Figure 14. The following details refer to the carotenoids numbered 1 
through 9 in Figure 14. 
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P-Carotene (1). R/U92 inseparable from authentic (1). R, .VIS 

^max nm: 

(428), 452, 457, %III/II = 0. 

Echinenone (2). R/-0.90 inseparable from authentic (2). R t .VIS A, max nm: 
455. %III/I1 = 0. 

5 Canthaxanthin (3). RfO.87. inseparable from authentic (3). R/ .VIS ^ max 

nm: 470. %III/II = 0. 

p-Cryptoxanthin(4).R^0.83. R, .VIS \ max nm: (428), 451, 479, %III/II - 

0. 

Adonirubin (5). R/*0. 82 inseparable from authentic (5). R t .VISX max nm: 
10 476, %III/II = 0. 

Astaxanthin (6). R/-0.79 inseparable from authentic (6). R^ .VIS Xmax 
nm: 477, %III/II = 0. 

Adonixanthin (7). Rf 0.72. R, .VIS ^ max nm: 464, %III/II = 0. 
Zeaxanthin (8). Rf 0.65 inseparable from authentic (8). R/ .VIS X max nm: 
15 (428), 451, 483, %III/II = 27. 

Hydroxyechinenone (9). R/0.80, R/, 3.0. VIS ?o max nm: 464, %III/II = 0. 
Chirality configuration. Chirality configuration of astaxanthin was 
determined by HPLC of the derived diastereoisomeric camphanates of the 
astaxanthin [Renstrom B, Borch G, Skulberg M and Liaaen-Jensen S (1981) 
20 Optical purity of (35',35 , )-astaxanthin from Haematococcus pluvialis. Phytochem 
20: 2561-2565]. The analysis proved that the Escherichia colt cells synthesize 
pure (3iS,3 f S) astaxanthin. 

EXAMPLE 1 

25 Cloning the $-C-4-oxygenase gene 

A cDNA library was constructed in Lambda ZAP II vector from poly-An 
RNA of Haematococcus pluvialis cells that had been induced to synthesize 
astaxanthin by nitrogen deprivation as described hereinabove. The entire library 

30 was excised into P-carotene-accumulating cells of Escherichia coli, strain SOLR, 
which carried plasmid pBCAR (shown in Figure 4). Screening for a P-carotene 
oxygenase gene was based on color visualization of colonies of size of 3 mm in 
diameter. Astaxanthin and other oxygenated forms of P-carotene (i.e.. 
xanthophylls) have distinct darker colors and thus can be detected from the yellow 

35 p-carotene background. The screening included approximately 100,000 colonies 
which were grown on LB medium plates containing ampicillin and 
chloramphenicol that selected for both the Lambda ZAP II vector in its plasmid 
propagating form and the pBCAR plasmid. Several colonies showed different 
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color tones but only one exhibited a conspicuous brown-red pigment. This colony 
presumed to contain a xanthophyll biosynthesis gene was selected for further 
analysis described hereinbelow in the following Examples. 

EXAMPLE 2 

Analysis of the $-C-4-oxygenase activity in Escherichia coli 

The red-brown colony presumed to contain a xanthophyll biosynthesis gene 
(see Example 1 above) was streaked and further analyzed. First, the recombinant 
ZAP II plasmid carrying the cDNA clone that was responsible for xanthophyll 
synthesis in Escherichia coli was isolated by preparing plasmid DNA from the red- 
brown colony, transfecting it to Escherichia coli cells of the strain XL 1 -Blue and 
selection on ampicillin-containing medium. This plasmid, designated pHPK 
(pHPK is a Lambda ZAP II vector containing an insert isolated from the red-brown 
colony), was used to transform P-carotene-producing Escherichia coli cells 
(Escherichia coli SOLR strain that carry the plasmid pBCAR shown in Figure 4) 
resulting in the formation of red-brown colonies. Carotenoids from this 
transformant, as well as from the host cells (as control) were extracted by acetone 
and analyzed by HPLC. 

HPLC analysis of carotenoids of the host bacteria which synthesized p- 
carotene (Escherichia coli SOLR strain that carry the plasmid pBCAR shown in 
Figure 4), as compared with a brown-red colony, revealed that only traces of p- 
carotene were observed in the transformant cells while a new major peak of 
canthaxanthin and another minor peak of echinenone appeared [described in detail 
by Lotan and Hirschberg (1995) FEBS letters 364: 125-128]. These results 
indicate that the cDNA in plasmid pHPK, designated crtO encodes an enzyme with 
P-C-4-oxygenase activity, which converts P-carotene to canthaxanthin via 
echinenone (see Figure 14). It is, therefore concluded that a single enzyme 
catalyzes this two-step ketonization conversion by acting symmetrically on the 4 
and 4' carbons of the P- and p'-rings of P-carotene, respectively. 

EXAMPLE 3 
Production of astaxanthin in Escherichia coli cells 

To determine whether P-carotene hydroxylase (e.g., a product of the criZ 
gene of Erwinia herbicola) can convert thus produced canthaxanthin to astaxanthin 
and/or whether zeaxanthin converted from p-carotene by p-carotene hydroxylase 
can be converted by p-C-4-oxygenase to astaxanthin, the crtO cDNA of 
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Haematococcus pluvialis thus isolated, was expressed in Escherichia coli cells 
together with the crtZ gene of Erwinia herbicola. For this purpose, Escherichia 
coli cells of strain SOLR were transfected with either plasmid pASTA alone 
containing, as shown in Figure 8, both crtZ and crtO or, alternatively with both 

5 plasmids, pHPK containing, as shown in Figure 6, crtO, and pZEAX containing, as 
shown in Figure 5, crtZ. Carotenoids in the resulting transformed cells were 
extracted and analyzed by HPLC as described above. The results, given in Table 
1, show the composition of carotenoids extracted from the cells containing the 
plasmid pASTA. Similar carotenoid composition is found in Escherichia coli cells 

io which carry both pHPK and pZE AX. 

TABLE 1 



15 



Carotenoid % of total carotenoid composition 



p-Carotene 8.0 

Echineone 1.7 

P-Cryptoxanthin 4.2 

Canthaxanthin 4.2 

20 Zeaxanthin 57.8 

Adonirubin 1 .0 

Adonixanthin 17.9 

Astaxanthin 5.2 



25 The results presented in Table 1, prove that carotenoids possessing either a 

P-end group or a 4-keto-P-end group act as substrates for the hydroxylation 
reactions catalyzed by crtZ gene product at carbons C-3 and C-3\ The 
hydroxylation of P-carotene and canthaxanthin results in the production of 
zeaxanthin and astaxanthin, respectively. These hydroxylations result in the 

30 production of astaxanthin and the intermediate ketocarotenoids, 3- 
hydroxyechinenone, adonixanthin and adonirubin. These results further 
demonstrate that astaxanthin can be produced in heterologous cells by expressing 
the gene crtO together with a gene that codes for a P-carotene hydroxylase. 

35 EXAMPLE 4 

Sequence analysis of the gene for ^-carotene C-4-oxygenase 

The full length, as was determined by the presence of a poly A tail, of the 
cDNA insert in plasmid pHPK (1771 base pairs) was subjected to nucleotide 
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sequence analysis. This sequence, set forth in SEQ ID NO:l, and its translation to 
an amino acid sequence set forth in SEQ ID NO:3 (329 amino acids), were 
deposited in EMBL database on May 1, 1995, and obtained the EMBL accession 
numbers X86782 and X86783, respectively. 
5 An open reading frame (ORF) of 825 nucleotides (nucleotides 166 through 

1 152 in SEQ ID NO:3) was identified in this sequence. This ORF codes for the 
enzyme P-carotene C-4-oxygenase having 329 amino acids set forth in SEQ ID 
NO:4, as proven by its functional expression in Escherichia coli cells (see 
Example 3 above). The gene for this enzyme was designated crtO. 

10 

EXAMPLE 5 
Transformation of cyanobacteria with crtO 

The plasmid DNA of pPAN3.5-KETO, shown in Figure 9, was transfected 

15 into cells of the cyanobacterium Synechococcus PCC7942 according to the method 
described by Golden [Golden SS (1988) Mutagenesis of cyanobacteria by classical 
and gene-transfer-based methods. Methods Enzymol 167: 714-727]. The 
cyanobacterial cells were plated on BG11 medium-containing petri dishes that 
contained also chloramphenicol. Colonies of chloramphenicol-resistant 

20 Synechococcus PCC7942 which appeared after ten days were analyzed for their 
carotenoid content. As detailed in Table 2 below, HPLC analysis of these cells 
revealed that the major carotenoid components of the cells was p-carotene, 
echinenone, canthaxanthin, adonirubin and astaxanthin. A similar analysis of the 
wild type strain and of Synechococcus PCC7942 transfected with a plasmid in 

25 which the orientation of the crtO gene is reversed (not shown), which is therefore 
not capable of producing an active protein, did not revealed production of 
echinenone, canthaxanthin, adonirubin and astaxanthin. 

These result prove that crtO of Haematococcus pluvialis can be expressed 
in cyanobacteria and that its expression provided a P-C-4-oxygenase enzymatic 

30 activity needed for the conversion of P-carotene to canthaxanthin. This result 
further demonstrates that the endogenous P-carotene hydroxylase of 
Synechococcus PCC7942 is able to convert thus produced canthaxanthin to 
astaxanthin. Since the carotenoid biosynthesis pathway is similar in all green 
photosynthetic organism [see Figures 1 and 10 and, Pecker I, Chamovitz D, Linden 

35 H, Sandmann G and Hirschberg J (1992) A single polypeptide catalyzing the 
conversion of phytoene to ^-carotene is transcriptionally regulated during tomato 
fruit ripening. Proc Natl Acad Sci USA 89: 4962-4966] it is deduced that 
astaxanthin can be produced in algae, and higher plants by expressing crtO in any 
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tissue that express also the endogenous p-carotene hydroxylase. It is further 
deduced that astaxanthin can be produced by any organism provided it contains 
either endogenous or engineered P-carotene biosynthesis pathway, by expressing 
crtO in any tissue that express either endogenous or genetically engineered p- 
carotene hydroxylase. 

TABLE 2 



Carotenoid % of total carotenoid composition 

P-Carotene 31.5 

Echinenone 18.5 

Canthaxanthin 16.1 

Zeaxanthin 22.3 

Adonirubin 6.0 

Astaxanthin 5.6 



EXAMPLE 6 
Determining the chirality configuration of astaxanthin 
produced in heterologous systems 

The chirality configurations of astaxanthin produced by Escherichia coli 
cells, as described under Example 3 hereinabove, and by cyanobacterium 
Synechococcus PCC7942 cells, as described in Example 5 hereinabove, were 
determined by HPLC of the derived diastereoisomeric carnphanates of the 
astaxanthin [Renstrom B, Borch G, Skulberg M and Liaaen- Jensen S (1981) 
Optical purity of (3*5,3 S')-astaxanthin from Haematococcus pluvialis. Phytochem 
20: 2561-2565]. The analysis proved that the Escherichia coli and Synechococcus 
PCC7942 cells described above, synthesize pure (3S,3'S) astaxanthin. 

EXAMPLE 7 
Transformation of a higher plant with crtO 

Producing natural astaxanthin in higher plants has two anticipated 
benefits. First, as a pure chemical, astaxanthin is widely used as feed additive 
for fish. It is a potential food colorant suitable for humans consumption and 
has potential applications in the cosmetic industry. Second, inducing 
astaxanthin biosynthesis in vivo in flowers and fruits will provide attractive 
pink/red colors which will increase their appearance and/or nutritious worth. 
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In flowers and fruits carotenoids are normally synthesized and 
accumulated to high concentration in chromoplasts, a typical pigment- 
containing plastids, thus providing typical intense colors to these organs. 
Inducing synthesis of astaxanthin in chromoplasts enables the accumulation of 
5 high concentration of this ketocarotenoid. Over-expression of carotenoid 
biosynthesis genes which results in elevated concentrations of carotenoids in 
chloroplasts, or other alterations in carotenoid composition in chloroplasts may 
damage the thylakoid membranes, impair photosynthesis and thus is deleterious 
to the plants. In contrast, increase of carotenoid concentration or alteration in 

io carotenoid composition in chromoplasts do not affect the viability of the plant 
nor the yield of fruits and flowers. 

Thus, gene-transfer technology was used to implant the crtO gene 
isolated from the alga Haematococcus pluvialis, as described, into a higher 
plant, in such a way that its expression is up-regulated especially in 

1 5 chromoplast-containing cells. 

To this end, a T-DNA containing binary plasmid vector as shown in 
Figure 12 was assembled in E. coli from the promoter and coding DNA 
sequences of the transit peptide encoded by the Pds gene from a tomato species 
Lycopersicon esculentum, linked to the coding DNA sequence of crtO from H. 

20 pluvialis. Upon stable transfer of this DNA construct via Agrobacterium- 
mediated transformation into a tobacco (Nicotiana tabacum NN) plant to form a 
transgenic plant, as described under methods above, the plant acquired the 
ability to produce ketocarotenoids especially in flower tissues (chromoplast- 
containing cells). It should be noted that the Pds gene promoter is capable of 

25 directing transcription and therefore expression especially in chloroplasts 
and/or chromoplasts-containing tissues of plants. It should be further noted that 
the transit peptide encoded by part of the Pds coding sequence is capable of 
directing conjugated (i.e., in frame) proteins into plant chromoplasts and/or r 
chloroplasts. 

30 As shown in Figure 15, in chromoplasts-containing cells, such as in the 

nectary tissue of the flower of tobacco, this DNA construct induces 
accumulation of astaxanthin and other ketocarotenoids to a higher level which 
alters the color from the normal yellow to red. 

Concentration and composition of carotenoids in chloroplasts-containing 

35 tissues, such as leaves, and in chromoplast-containing tissues, such as flowers, 
were determined in the transgenic plants and compared to normal non- 
transformed plants. 



WO 98/18910 



PCT/US97/17819 



40 

Carotenoids compositions in leaves (chloroplasts-containing tissue) and 
in the nectary tissue of flowers (chromoplast containing tissue) of wild type and 
transgenic tobacco plants were determined by thin layer chromatography (TLC) 
and by high pressure liquid chromatography (HPLC) as described above. 
5 Total carotenoids concentration in leaves (chloroplasts-containing 

tissue) and in the nectary tissue of flowers (chromoplast containing tissue) of 
wild type and transgenic tobacco plants are summarized in Tables 3 below. 

Percents of carotenoids composition in leaves of wild-type and 
transgenic tobacco plants are summarized in Tables 4 below. 
io Percents of carotenoids composition in the nectary tissue of flowers of 

wild-type and transgenic tobacco plants are summarized in Tables 5 below. 



15 TABLE 3 

jag carotenoids per gr fresh weight 

Wild-type Transgenic with crtO 

Leaf 

20 (Chloroplasts) 200 240 

Nectary tissue 

(Chromoplasts) 280 360 



25 



TABLE 4 

30 % of total carotenoids composition in chloroplasts-containing tissue (leaf) 





Wild-type 


Transgenic 


P-carotene 


29.9 


26.7 


neoxanthin 


5.0 


5.9 


violaxanthin 


11.6 


18.1 


antheraxanthin 


4.9 


2.6 


lutein 


43.9 


41.4 


zeaxanthin 


4.7 


4.3 


astaxanthin + adonirubin 0.0 


1.0 
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TABLE 5 

% of total carotenoid composition in chromoplasts-containing tissue (flower) 





Wild-type 


Transgenic 


beta-carotene 


58.1 


21.0 


violaxanthin 


40.3 


1.5 


lutein 


0.0 


1.1 


zeaxanthin 


1.6 


1.0 


hydroxyechinenone 


0.0 


13.7 


3 'hydroxy ech inenone 


0.0 


4.1 


adonirubin 


0.0 


22.4 


adonixanthin 


0.0 


8.7 


astaxanthin 


0.0 


26.5 



Please note the elevated content of hydroxyechinenone, 
3 'hydroxyechinenone, adonirubin, adonixanthin and astaxanthin especially in 
the chromoplast containing tissue of the transgenic tobacco plants. 

Thus, the present invention successfully addresses the shortcomings of the 
presently known configurations by enabling a relatively low cost biotechnological 
production of (3S,yS) astaxanthin by providing a peptide having a (i-C-4- 
oxygenase activity; a DNA segment coding for this peptide; an RNA segments 
coding for this peptide; a recombinant DNA molecule comprising a vector and the 
DNA segment; a host containing the above described recombinant DNA molecule 
or DNA segment; and of a method for biotechnologically producing {3S,yS) 
astaxanthin or a food additive containing (3S3'S) astaxanthin, using the host. 



While the invention has been described with respect to a limited number of 
embodiments, it will be appreciated that many variations, modifications and other 
applications of the invention may be made. 
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SEQUENCE LISTING 



(1) 



(2) 



GENERAL 


INFORMATION: 




(i) 


APPLICANTS: 


Joseph Hirschberg, Tamar Lotan and 
Mark Barker 


<H> 


TITLE OF INVENTION: 


Polynucleotide molecule from 
Haematococcus pluvial is encoding a 
polypeptide having a p-C- 4 -oxygenase 

activity for biotechno logical product i 
(3S,3'S> astaxanthin. 


(in) 


NUMBER OF SEQUENCES: 


4 


Civ) 


CORRESPONDENCE ADDRESS: 






(A) ADDRESSEE: 


Mark M. Friedman c/o Robert Sheinbein 




(B) STREET: 


2940 Birchtree space lane 




(C) CITY: 


Silver Spring 




(D) STATE: 


Maryland 




(E) COUNTRY: 


United States of America 




(F) ZIP: 


20906 


(v) 


COMPUTER READABLE FORM: 






(A) MEDIUM TYPE: 


1.44 megabyte, 3.5" microdisk 




(B) COMPUTER: 


Twinhead Stimnote-890TX 




(C) OPERATING SYSTEM 


: MS DOS version 6.2, 
Windows version 3.11 




(D) SOFTWARE: 


Word for Windows version 2.0 


(vi> 


CURRENT APPLICATION DATA: 






(A) APPLICATION NUMBER: 




<B> FILING DATE: 






<C) CLASSIFICATION: 




(vii) 


PRIOR APPLICATION DATA: 






(A) APPLICATION NUMBER: 




(B) FILING DATE: 




(viii) 


ATTORNEY/AGENT INFORMATION: 




(A) NAME: 


Friedmam, Mark M. 




(B) REGISTRATION NUMBER: 33,883 




(C) REFERENCE/DOCKET 


NUMBER: 325/5 


(ix> 


TELECOMMUNICATION INFORMATION: 




<A) TELEPHONE: 


972-3-5625553 




(B) TELEFAX: 


972-3-5625554 




(C) TELEX: 




INFORMATION FOR SEQ ID N0:1: 




(i) 


SEQUENCE CHARACTERISTICS: 




(A) LENGTH: 


1771 base pairs 




(B) TYPE: 


nucleic acid 




(C) STRANDEDNESS: 


double 




(D) TOPOLOGY: 


linear 


(xi> 


SEQUENCE DESCRIPTION: 


SEQ ID NO:1: 



GGC ACG AGC TTG CAC GCA AGT CAG CGC 
TCC ACA GCC TCA AAT AAT AAA GAG CTC 
TGG CCA GTC TGC ACT GCC TTG AAC CCG 
TGC CAT AGC ACA GCT AGA CGA ATG CAG 
GAG CAG CTT ACC GGA AGC GCT GAG GCA 
GTT GCA GGC AGC TCT GAC GTG TTG CGT 
CTT CCG TCA GAA GAG TCA GAC GCG GCC 
TAC AAG CCA CCA CCT TCC GAC ACA AAG 
GTC ATC GGC TCC TGG GCC GCA GTG TTC 
AAG CTT CCG ACC TCC TTG GAC CAG CTG 
GCC ACA GCT CAG CTG GTT AGC GGC ACG 



GCG 


CAA GTC AAC ACC 


TGC 


CGG 


48 


AAG 


CGT 


TTG TGC 


GCC 


TCG 


ACG 


96 


CGA 


GTC 


TCC CGC 


CGC 


ACT 


GAC 


144 


CTA 


GCA 


GCG ACA 


GTA 


ATG 


TTG 


192 


CTC 


AAG GAG AAG 


GAG 


AAG 


GAG 


240 


ACA 


TGG 


GCG ACC 


CAG 


TAC 


TCG 


288 


CGC 


CCG 


GGA CTG 


AAG 


AAT 


GCC 


336 


GGC 


ATC 


ACA ATG 


GCG 


CTA 


CGT 


384 


CTC 


CAC 


GCC ATT 


TTT 


CAA 


ATC 


432 


CAC 


TGG 


CTG CCC 


GTG 


TCA 


GAT 


480 


AGC 


AGC 


CTG CTC 


GAC 


ATC 


GTC 


528 



WO 98/18910 



PC17US97/17819 



43 



GTA 


GTA 


TTC 


TTT 


GTC 


CTG 


GAG 


TTC 


CTG 


TAC 


ACA 


GGC 


CTT 


TTT 


ATC 


ACC 


576 


ACG 


CAT 


GAT 


GCT 


ATG 


CAT 


GGC 


ACC 


ATC 


GCC 


ATG 


AGA 


AAC 


AGG 


CAG 


CTT 


624 


AAT 


GAG 


TTC 


TTG 


GGC 


AGA 


GTA 


TGC 


ATC 


TCC 


TTG 


TAC 


GCC 


TGG 


TTT 


GAT 


672 


TAC 


AAC 


ATG 


CTG 


CAC 


CGC 


AAG 


CAT 


TGG 


GAG 


CAC 


CAC 


AAC 


CAC ACT 


GGC 


720 


GAG 


GTG 


GGC 


AAG 


GAC 


CCT 


GAC 


TTC 


CAC 


AGG 


GGA 


AAC 


CCT 


GGC 


ATT 


GTG 


768 


ccc 


TGG 


TTT 


GCC 


AGC 


TTC 


ATG 


TCC 


AGC 


TAC 


ATG 


TCG 


ATG 


TGG 


CAG 


TTT 


816 


GCG 


CGC 


CTC 


GCA 


TGG 


TGG 


ACG 


GTG 


GTC 


ATG 


CAG 


CTG 


CTG 


GGT 


GCG 


CCA 


864 


ATG 


GCG 


AAC 


CTG 


CTG 


GTG 


TTC 


ATG 


GCG 


GCC 


GCG 


CCC 


ATC 


CTG 


TCC 


GCC 


912 


TTC 


CGC 


TTG 


TTC 


TAC 


TTT 


GGC 


ACG 


TAC 


ATG 


CCC 


CAC 


AAG 


CCT 


GAG 


CCT 


960 


GGC 


GCC 


GCG 


TCA 


GGC 


TCT 


TCA 


CCA 


GCC 


GTC 


ATG 


AAC 


TGG TGG AAG 


TCG 


1008 


CGC 


ACT 


AGC 


CAG 


GCG 


TCC 


GAC 


CTG 


GTC 


AGC 


TTT 


CTG 


ACC 


TGC 


TAC 


CAC 


1056 


TTC 


GAC 


CTG 


CAC 


TGG 


GAG 


CAC 


CAC 


CGC 


TGG 


CCC 


TTC 


GCC 


CCC 


TGG 


TGG 


1104 


GAG 


CTG 


CCC 


AAC 


TGC 


CGC 


CGC 


CTG 


TCT 


GGC 


CGA 


GGT 


CTG 


GTT 


CCT 


GCC 


1152 


TAG 


CTG 


GAC 


ACA 


CTG 


CAG 


TGG 


GCC 


CTG 


CTG 


CCA 


GCT 


GGG 


CAT 


GCA 


GGT 


1200 


TGT 


GGC 


AGG 


ACT 


GGG 


TGA 


GGT 


GAA 


AAG 


CTG 


CAG 


GCG 


CTG 


CTG 


CCG 


GAC 


1248 


ACG 


CTG 


CAT 


GGG 


CTA 


CCC 


TGT 


GTA 


GCT 


GCC 


GCC 


ACT 


AGG 


GGA 


GGG 


GGT 


1296 


TTG 


TAG 


CTG 


TCG 


AGC 


TTG 


CCC 


CAT 


GGA 


TGA 


AGC 


TGT 


GTA 


GTG 


GTG 


CAG 


1344 


GGA 


GTA 


CAC 


CCA 


CAG 


GCC 


AAC 


ACC 


CTT 


GCA 


GGA 


GAT 


GTC 


TTG 


CGT 


CGG 


1392 


GAG 


GAG 


TGT 


TGG 


GCA 


GTG 


TAG 


ATG 


CTA 


TGA 


TTG 


TAT 


CTT 


AAT 


GCT 


GAA 


1440 


GCC 


TTT 


AGG 


GGA 


GCG 


ACA 


CTT 


AGT 


GCT 


GGG 


CAG 


GCA 


ACG 


CCC 


TGC 


AAG 


1488 


GTG 


CAG 


GCA 


CAA 


GCT 


AGG 


CTG 


GAC 


GAG 


GAC 


TCG 


GTG 


GCA GGC AGG 


TGA 


1536 


AGA 


GGT 


GCG 


GGA 


GGG 


TGG 


TGC 


CAC 


ACC 


CAC 


TGG 


GCA 


AGA 


CCA 


TGC 


TGC 


1584 


AAT 


GCT 


GGC 


GGT 


GTG 


GCA 


GTG 


AGA 


GCT 


GCG 


TGA 


TTA 


ACT 


GGG 


CTA 


TGG 


1632 


ATT 


GTT 


TGA 


GCA 


GTC 


TCA 


CTT 


ATT 


CTT 


TGA 


TAT 


AGA 


TAC 


TGG 


TCA 


GGC 


1680 


AGG 


TCA 


GGA 


GAG 


TGA 


GTA 


TGA 


ACA 


AGT 


TGA 


GAG 


GTG 


GTG 


CGC 


TGC 


CCC 


1728 


TGC 


GCT 


TAT 


GAA 


GCT 


GTA 


ACA 


ATA 


AAG 


TGG 


TTC 
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(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1771 bases 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



GGC 


ACG 


AGC 


UUG 


CAC 


GCA 


AGU 


CAG 


CGC 


GCG 


CAA 


GUC AAC ACC UGC 


CGG 


48 


UCC 


ACA 


GCC 


UCA 


AAU 


AAU 


AAA 


GAG 


cue 


AAG 


CGU 


UUG UGC GCC UCG 


ACG 


96 


UGG 


CCA 


GUC 


UGC 


ACU 


GCC 


UUG 


AAC 


CCG 


CGA 


GUC 


UCC CGC CGC ACU 


GAC 


144 


UGC 


CAU 


AGC 


ACA 


GCU 


AGA 


CGA 


AUG 


CAG 


CUA 


GCA 


GCG ACA GUA AUG 


UUG 


192 


GAG 


CAG 


CUU 


ACC 


GGA 


AGC 


GCU 


GAG 


GCA 


cue 


AAG 


GAG AAG GAG AAG 


GAG 


240 


GUU 


GCA 


GGC 


AGC 


UCU 


GAC 


GUG 


UUG 


CGU 


ACA 


UGG 


GCG ACC CAG UAC 


UCG 


288 


CUU 


CCG 


UCA 


GAA 


GAG 


UCA 


GAC 


GCG 


GCC 


CGC 


CCG 


GGA CUG AAG AAU 


GCC 


336 


UAC 


AAG 


CCA 


CCA 


ecu 


UCC 


GAC 


ACA 


AAG 


GGC 


AUC 


ACA AUG GCG CUA 


CGU 


384 


GUC 


AUC 


GGC 


UCC 


UGG 


GCC 


GCA 


GUG 


UUC 


cue 


CAC 


GCC AUU UUU CAA 


AUC 


432 


AAG 


CUU 


CCG 


ACC 


UCC 


UUG 


GAC 


CAG 


CUG 


CAC 


UGG 


CUG CCC GUG UCA GAU 


480 


GCC 


ACA 


GCU 


CAG 


CUG 


GUU 


AGC 


GGC 


ACG 


AGC 


AGC 


CUG CUC GAC AUC 


GUC 


528 


GUA 


GUA 


UUC 


UUU 


GUC 


CUG 


GAG 


UUC 


CUG 


UAC 


ACA 


GGC CUU UUU AUC 


ACC 


576 


ACG 


CAU 


GAU 


GCU 


AUG 


CAU 


GGC 


ACC 


AUC 


GCC 


AUG 


AGA AAC AGG CAG 


CUU 


624 


AAU 


GAC 


UUC 


UUG 


GGC 


AGA 


GUA 


UGC 


AUC 


UCC 


UUG 


UAC GCC UGG UUU 


GAU 


672 


UAC 


AAC 


AUG 


CUG 


CAC 


CGC 


AAG 


CAU 


UGG 


GAG 


CAC 


CAC AAC CAC ACU 


GGC 


720 


GAG 


GUG 


GGC 


AAG 


GAC 


ecu 


GAC 


UUC 


CAC 


AGG GGA 


AAC CCU GGC AUU 


GUG 


768 


CCC 


UGG 


uuu 


GCC 


AGC 


UUC 


AUG 


UCC 


AGC 


UAC 


AUG 


UCG AUG UGG CAG 


UUU 


816 


GCG 


CGC 


cue 


GCA 


UGG 


UGG 


ACG 


GUG 


GUC 


AUG 


CAG 


CUG CUG GGU GCG 


CCA 


864 


AUG 


GCG 


AAC 


CUG 


CUG 


GUG 


UUC 


AUG 


GCG 


GCC GCG 


CCC AUC CUG UCC 


GCC 


912 


UUC 


CGC 


UUG 


UUC 


UAC 


UUU 


GGC 


ACG 


UAC 


AUG 


CCC 


CAC AAG CCU GAG 


CCU 


960 


GGC 


GCC 


GCG 


UCA 


GGC 


UCU 


UCA 


CCA 


GCC 


GUC 


AUG 


AAC UGG UGG AAG UCG 


1008 


CGC 


ACU 


AGC 


CAG 


GCG 


UCC 


GAC 


CUG 


GUC 


AGC UUU 


CUG ACC UGC UAC CAC 


1056 
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uuc 


GAC 


CUG 


CAC 


UGG 


GAG 


CAC 


CAC 


CGC 


UGG CCC 


UUC 


GCC 


CCC 


UGG 


UGG 


1104 


GAG 


CUG 


CCC 


AAC 


UGC 


CGC 


CGC 


CUG 


ucu 


GGC CGA 


GGU 


CUG 


GUU 


ecu 


GCC 


1152 


UAG 


CUG 


GAC 


ACA 


CUG 


CAG 


UGG 


GCC 


CUG 


CUG CCA 


GCU 


GGG 


CAU 


GCA 


GGU 


1200 


UGU 


GGC 


AGG 


ACU 


GGG 


UGA 


GGU 


GAA 


AAG 


CUG CAG 


GCG 


CUG 


CUG 


CCG 


GAC 


1248 


ACG 


CUG 


CAU 


GGG 


CUA 


CCC 


UGU 


GUA 


GCU 


GCC GCC 


ACU 


AGG 


GGA 


GGG 


GGU 


1296 


UUG 


UAG 


CUG 


UCG 


AGC 


UUG 


CCC 


CAU 


GGA 


UGA AGC 


UGU 


GUA 


GUG 


GUG 


CAG 


1344 


GGA 


GUA 


CAC 


CCA 


CAG 


GCC 


AAC 


ACC 


CUU 


GCA GGA 


GAU 


GUC 


UUG 


CGU 


CGG 


1392 


GAG 


GAG 


UGU 


UGG 


GCA 


GUG 


UAG 


AUG 


CUA 


UGA UUG 


UAU 


CUU 


AAU 


GCU 


GAA 


1440 


GCC 


UUU 


AGG 


GGA 


GCG 


ACA 


CUU 


AGU 


GCU 


GGG CAG 


GCA 


ACG 


CCC 


UGC 


AAG 


1488 


GUG 


CAG 


GCA 


CAA 


GCU 


AGG 


CUG 


GAC 


GAG 


GAC UCG 


GUG 


GCA 


GGC 


AGG 


UGA 


1536 


AGA 


GGU 


GCG 


GGA 


GGG 


UGG 


UGC 


CAC 


ACC 


CAC UGG 


GCA 


AGA 


CCA 


UGC 


UGC 


1584 


AAU 


GCU 


GGC 


GGU 


GUG 


GCA 


GUG 


AGA 


GCU 


GCG UGA 


UUA 


ACU 


GGG 


CUA 


UGG 


1632 


AUU 


GUU 


UGA 


GCA 


GUC 


UCA 


CUU 


AUU 


CUU 


UGA UAU 


AGA 


UAC 


UGG 


UCA 


GGC 


1680 


AGG 


UCA 


GGA 


GAG 


UGA 


GUA 


UGA 


ACA 


AGU 


UGA GAG 


GUG 


GUG 


CGC 


UGC 


CCC 


1728 


UGC 


GCU 


UAU 


GAA 


GCU 


GUA 


ACA 


AUA 


AAG 


UGG UUC 
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(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 1771 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 



GGC 


ACG 


AGC 


TTG 


CAC 


GCA 


AGT 


CAG 


CGC 


GCG CAA 


GTC 


AAC 


ACC 


TGC 


CGG 


48 


TCC 


ACA 


GCC 


TCA 


AAT 


AAT 


AAA 


GAG 


CTC 


AAG CGT 


TTG 


TGC 


GCC 


TCG 


ACG 


96 


TGG 


CCA 


GTC 


TGC 


ACT 


GCC 


TTG 


AAC 


CCG 


CGA GTC 


TCC 


CGC 


CGC 


ACT 


GAC 


144 


TGC 


CAT 


AGC 


ACA 


GCT 


AGA 


CGA 


ATG 


CAG 


CTA GCA 


GCG 


ACA 


GTA 


ATG 


TTG 


192 
















Met 


Gin 


Leu Ala 


Ala 


Thr 


Val 


Met 


Leu 




GAG 


CAG 


CTT 


ACC 


GGA 


AGC 


GCT 


GAG 


GCA 


CTC AAG 


5 

GAG 


AAG 


GAG 


AAG 


GAG 


240 


Glu 


Gin 


Leu 


Thr Gly 


Ser 


Ala 


Glu 


Ala 


Leu Lys 


Glu Lys 


Glu 


Lys 


Glu 




10 










15 








20 










25 




GTT 


GCA 


GGC 


AGC 


TCT 


GAC 


GTG 


TTG 


CGT 


ACA TGG 


GCG 


ACC 


CAG 


TAC 


TCG 


288 


Val 


Ala Gly 


Ser 


Ser 


Asp 


Val 


Leu 


Arg 


Thr Trp 


Ala 


Thr 


Gin 


Tyr 


Ser 












30 










35 








40 






CTT 


CCG 


TCA 


GAA 


GAG 


TCA 


GAC 


GCG 


GCC 


CGC CCG 


GGA 


CTG 


AAG 


AAT 


GCC 


336 


Leu 


Pro 


Ser 


Glu 


Glu 


Ser 


Asp 


Ala 


Ala 


Arg Pro 


Gly Leu 


Lys 


Asn 


Ala 










45 










50 








55 








TAC 


AAG 


CCA 


CCA 


CCT 


TCC 


GAC 


ACA 


AAG 


GGC ATC 


ACA 


ATG 


GCG 


CTA 


CGT 


384 


Tyr 


Lys Pro 


Pro 


Pro 


Ser 


Asp 


Thr 


Lys 


Gly He 


Thr 


Met 


Ala 


Leu 


Arg 








60 










65 








70 










GTC 


ATC 


GGC 


TCC 


TGG 


GCC 


GCA 


GTG 


TTC 


CTC CAC 


GCC 


ATT 


TTT 


CAA 


ATC 


432 


Val 


He Gly 


Ser Trp 


Ala 


Ala 


Val 


Phe 


Leu His 


Ala 


He 


Phe 


Gin 


He 






75 










80 








85 












AAG 


CTT 


CCG 


ACC 


TCC 


TTG 


GAC 


CAG 


CTG 


CAC TGG 


CTG 


CCC 


GTG 


TCA 


GAT 


480 


Lys 


Leu Pro 


Thr 


Ser 


Leu 


Asp 


Gin 


Leu 


His Trp 


Leu 


Pro 


Val 


Ser 


Asp 




90 










95 








100 










105 




GCC 


ACA 


GCT 


CAG 


CTG 


GTT 


AGC 


GGC 


ACG 


AGC AGC 


CTG 


CTC 


GAC 


ATC 


GTC 


528 


Ala 


Thr 


Ala 


Gin Leu 


Val 


Ser 


Gly 


Thr 


Ser Ser 


Leu 


Leu 


Asp 


He 


Val 












110 










115 








120 






GTA 


GTA 


TTC 


TTT 


GTC 


CTG 


GAG 


TTC 


CTG 


TAC ACA 


GGC 


CTT 


TTT 


ATC 


ACC 


576 


Val 


Val 


Phe 


Phe Val 


Leu 


Glu 


Phe 


Leu 


Tyr Thr 


Gly Leu 


Phe 


He 


Thr 










125 










130 








135 








ACG 


CAT 


GAT 


GCT 


ATG 


CAT 


GGC 


ACC 


ATC 


GCC ATG 


AGA 


AAC 


AGG 


CAG 


CTT 


624 


Thr 


His Asp 


Ala Met 


His 


Gly 


Thr 


He 


Ala Met 


Arg Asn 


Arg 


Gin 


Leu 





140 145 150 
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AAT 


GAC 


TTC 


TTG 


GGC 


AGA 


GTA 


TGC 


ATC 


TCC 


TTG 


TAC 


GCC 


TGG 


TTT GAT 


672 


Asn 


Asp Phe 


Leu 


Gly 


Arg 


Val 


Cys 


He 


Ser 


Leu 


Tyr 


Ala 


Trp Phe Asp 






155 










160 










165 










TAC 


AAC 


ATG 


CTG 


CAC 


CGC 


AAG 


CAT 


TGG 


GAG 


CAC 


CAC 


AAC 


CAC 


ACT GGC 


720 


Tyr 


Asn 


Met 


Leu 


His 


Arg 


Lys 


His 


Trp 


Glu 


His 


His 


Asn 


His Thr Gly 




170 










175 










180 








185 




GAG 


GTG 


GGC 


AAG 


GAC 


CCT 


GAC 


TTC 


CAC 


AGG 


GGA 


AAC 


CCT 


GGC 


ATT GTG 


768 


Glu 


Val 


Gly 


Lys 


Asp 


Pro 


Asp 


Phe 


His 


Arg 


Gly 


Asn 


Pro 


Gly He Val 












190 










195 










200 




CCC 


TGG 


TTT 


GCC 


AGC 


TTC 


ATG 


TCC 


AGC 


TAC 


ATG 


TCG 


ATG 


TGG 


CAG TTT 


816 


Pro 


Trp Phe 


Ala 


Ser 


Phe 


Met 


Ser 


Ser 


Tyr 


Met 


Ser 


Met 


Trp Gin Phe 










205 










210 










215 






GCG 


CGC 


CTC 


GCA 


TGG 


TGG 


ACG 


GTG 


GTC 


ATG 


CAG 


CTG 


CTG 


GGT 


GCG CCA 


864 


Ala 


Arg Leu 


Ala 


Trp 


Trp 


Thr 


Val 


Val 


Met 


Gin 


Leu 


Leu 


Gly Ala Pro 








220 










225 










230 








ATG 


GCG 


AAC 


CTG 


CTG 


GTG 


TTC 


ATG 


GCG 


GCC 


GCG 


CCC 


ATC 


CTG 


TCC GCC 


912 


Met 


Ala 


Asn 


Leu 


Leu 


Val 


Phe 


Met 


Ala 


Ala 


Ala 


Pro 


lie 


Leu 


Ser Ala 






235 










240 










245 










TTC 


CGC 


TTG 


TTC 


TAC 


TTT 


GGC 


ACG 


TAC 


ATG 


CCC 


CAC 


AAG 


CCT 


GAG CCT 


960 


Phe 


Arg 


Leu 


Phe 


Tyr 


Phe 


Gly 


Thr 


Tyr 


Met 


Pro 


His 


Lys 


Pro 


Glu Pro 




250 










255 










260 








265 




GGC 


GCC 


GCG 


TCA 


GGC 


TCT 


TCA 


CCA 


GCC 


GTC 


ATG 


AAC 


TGG 


TGG 


AAG TCG 


1008 


Gly 


Ala 


Ala 


Ser 


Gly 


Ser 


Ser 


Pro 


Ala 


Val 


Met 


Asn 


Trp 


Trp Lys Ser 












270 










275 










280 




CGC 


ACT 


AGC 


CAG 


GCG 


TCC 


GAC 


CTG 


GTC 


AGC 


TTT 


CTG 


ACC 


TGC 


TAC CAC 


1056 


Arg 


Thr 


Ser 


Gin 


Ala 


Ser 


Asp 


Leu 


Val 


Ser 


Phe 


Leu 


Thr 


Cys 


Tyr His 
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290 
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TTC 


GAC 


CTG 


CAC 


TGG 


GAG 


CAC 


CAC 


CGC 


TGG 


CCC 


TTC 


GCC 


CCC 


TGG TGG 


1104 


Phe 


Asp Leu 


His 


Trp 


Glu 


His 


His 


Arg 


Trp 


Pro 


Phe 


Ala 


Pro Trp Trp 








300 










305 










310 








GAG 


CTG 


CCC 


AAC 


TGC 


CGC 


CGC 


CTG 


TCT 


GGC 


CGA 


GGT 


CTG 


GTT 


CCT GCC 


1152 


Glu 


Leu 


Pro 


Asn 


Cys 


Arg 


Arg 


Leu 


Ser 


Gly 


Arg 


Gly 


Leu 


Val 


Pro Ala 






315 










320 










325 










TAG 


CTG 


GAC 


ACA 


CTG 


CAG 


TGG 


GCC 


CTG 


CTG 


CCA 


GCT 


GGG 


CAT 


GCA GGT 


1200 


TGT 


GGC 


AGG 


ACT 


GGG 


TGA 


GGT 


GAA 


AAG 


CTG 


CAG 


GCG 


CTG 


CTG 


CCG GAC 


1248 


ACG 


CTG 


CAT 


GGG 


CTA 


CCC 


TGT 


GTA 


GCT 


GCC 


GCC 


ACT 


AGG 


GGA 


GGG GGT 


1296 


TTG 


TAG 


CTG 


TCG 


AGC 


TTG 


CCC 


CAT 


GGA 


TGA 


AGC 


TGT 


GTA 


GTG 


GTG CAG 


1344 


GGA 


GTA 


CAC 


CCA 


CAG 


GCC 


AAC 


ACC 


CTT 


GCA 


GGA 


GAT 


GTC 


TTG 


CGT CGG 


1392 


GAG 


GAG 


TGT 


TGG 


GCA 


GTG 


TAG 


ATG 


CTA 


TGA 


TTG 


TAT 


CTT 


AAT 


GCT GAA 


1440 


GCC 


TTT 


AGG 


GGA 


GCG 


ACA 


CTT 


AGT 


GCT 


GGG 


CAG 


GCA 


ACG 


CCC 


TGC AAG 


1488 


GTG 


CAG 


GCA 


CAA 


GCT 


AGG 


CTG 


GAC 


GAG 


GAC 


TCG 


GTG 


GCA 


GGC 


AGG TGA 


1536 


AGA 


GGT 


GCG 


GGA 


GGG 


TGG 


TGC 


CAC 


ACC 


CAC 


TGG 


GCA 


AGA 


CCA 


TGC TGC 


1584 


AAT 


GCT 


GGC 


GGT 


GTG 


GCA 


GTG 


AGA 


GCT 


GCG 


TGA 


TTA 


ACT 


GGG 


CTA TGG 


1632 


ATT 


GTT 


TGA 


GCA 


GTC 


TCA 


CTT 


ATT 


CTT 


TGA 


TAT 


AGA 


TAC 


TGG 


TCA GGC 


1680 


AGG 


TCA 


GGA 


GAG 


TGA 


GTA 


TGA 


ACA 


AGT 


TGA 


GAG 


GTG 


GTG 


CGC 


TGC CCC 


1728 


TGC 


GCT 


TAT 


GM 


GCT 


GTA 


ACA 


ATA 


AAG 


TGG 


TTC 










1771 



(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: I 

(A) LENGTH: 329 amino acids 

(B) TYPE: amino acid 
CO TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Gin Leu Ala Ala Thr Val Met Leu 
5 

Glu Gin Leu Thr Gly Ser Ala Glu Ala Leu Lys Glu Lys Glu Lys Glu 
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10 15 20 25 

Val Ala Gly Ser Ser Asp Vat Leu Arg Thr Trp Ala Thr Gin Tyr Ser 

30 35 40 

Leu Pro Ser Glu Glu Ser Asp Ala Ala Arg Pro Gly Leu Lys Asn Ala 

45 50 55 

Tyr Lys Pro Pro Pro Ser Asp Thr Lys Gly He Thr Met Ala Leu Arg 

60 65 70 

Val lie Gly Ser Trp Ala Ala Val Phe Leu His Ala He Phe Gin lie 

75 80 85 

Lys Leu Pro Thr Ser Leu Asp Gin Leu His Trp Leu Pro Val Ser Asp 
90 95 100 105 

Ala Thr Ala Gin Leu Val Ser Gly Thr Ser Ser Leu Leu Asp lie Val 

110 115 120 

Val Val Phe Phe Val Leu Glu Phe Leu Tyr Thr Gly Leu Phe He Thr 

125 130 135 

Thr His Asp Ala Met His Gly Thr He Ala Met Arg Asn Arg Gin Leu 

140 145 150 

Asn Asp Phe Leu Gly Arg Val Cys He Ser Leu Tyr Ala Trp Phe Asp 

155 160 165 

Tyr Asn Met Leu His Arg Lys His Trp Glu His His Asn His Thr Gly 
170 175 180 185 

Glu Val Gly Lys Asp Pro Asp Phe His Arg Gly Asn Pro Gly He Val 

190 195 200 

Pro Trp Phe Ala Ser Phe Met Ser Ser Tyr Met Ser Met Trp Gin Phe 

205 210 215 

Ala Arg Leu Ala Trp Trp Thr Val Val Met Gin Leu Leu Gly Ala Pro 

220 225 230 

Met Ala Asn Leu Leu Val Phe Met Ala Ala Ala Pro He Leu Ser Ala 

235 240 245 

Phe Arg Leu Phe Tyr Phe Gly Thr Tyr Met Pro His Lys Pro Glu Pro 
250 255 260 265 

Gly Ala Ala Ser Gly Ser Ser Pro Ala Val Met Asn Trp Trp Lys Ser 

270 275 280 

Arg Thr Ser Gin Ala Ser Asp Leu Val Ser Phe Leu Thr Cys Tyr His 

285 290 295 

Phe Asp Leu His Trp Glu His His Arg Trp Pro Phe Ala Pro Trp Trp 

300 305 310 

Glu Leu Pro Asn Cys Arg Arg Leu Ser Gly Arg Gly Leu Val Pro Ala 
315 320 325 
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WHAT IS CLAIMED IS: 

1. A DNA segment comprising a nucleotide sequence coding for a 
polypeptide having an amino acid sequence as set forth in SEQ ID NO:4 

2. A DNA segment as in claim 1, wherein said nucleotide sequence is a 
variant selected from the group of variants consisting of allelic variants, species 
variants, naturally occurring variants, man-induced variants and combinations 
thereof. 

3. A DNA segment as in claim 1, wherein said nucleotide sequence 
includes a sequence as set forth in SEQ ID NO:l. 

4. A DNA segment as in claim 1, wherein said nucleotide sequence 
includes a sequence as set forth between and including nucleotides 166 and 1152 
ofSEQIDNO:l. 

5. A DNA segment comprising a nucleotide sequence hybridizing 
under high stringency conditions to a nucleic acid probe, said probe including at 
least fifteen successive nucleotides of SEQ ID NO: 1 . 

6. A DNA segment comprising a nucleotide sequence hybridizing 
under low-stringency conditions to a nucleic acid probe, said probe including 
nucleotides 166 through 1 1 52 of SEQ ID NO: 1 . 

7. A DNA segment comprising a nucleotide sequence hybridizing 
under high stringency conditions to a nucleic acid probe, said probe including at 
least fifteen successive nucleotides of SEQ ID NO:2. 

8. A DNA segment comprising a nucleotide sequence hybridizing 
under low-stringency conditions to a nucleic acid probe, said probe including 
nucleotides 166 through 1 152 of SEQ ID NO:2. 

9. An RNA segment comprising a nucleotide sequence coding for a 
polypeptide having an amino acid sequence as set forth in SEQ ID NO:4 

10. An RNA segment as in claim 9, wherein said nucleotide sequence is 
a variant selected from the group of variants consisting of allelic variants, species 
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variants, naturally occurring variants, man-induced variants and combinations 
thereof. 

11. An RNA segment as in claim 9, wherein said nucleotide sequence 
includes a sequence as set forth in SEQ ID NO:2. 

12. An RNA segment as in claim 9, wherein said nucleotide sequence 
includes a sequence as set forth between and including nucleotides 166 and 1152 
of SEQ ID NO:2. 

13. An RNA segment comprising a nucleotide sequence hybridizing 
under high stringency conditions to a nucleic acid probe, said probe including at 
least fifteen successive nucleotides of SEQ ID NO: 1 . 

14. An RNA segment comprising a nucleotide sequence hybridizing 
under low-stringency conditions to a nucleic acid probe, said probe including 
nucleotides 166 through 1152 of SEQ ID NO:l. 

15. An RNA segment comprising a nucleotide sequence hybridizing 
under high stringency conditions to a nucleic acid probe, said probe including at 
least fifteen successive nucleotides of SEQ ID NO:2. 

16. An RNA segment comprising a nucleotide sequence hybridizing 
under low-stringency conditions to a nucleic acid probe, said probe including 
nucleotides 166 through 1 152 of SEQ ID NO:2. 

17. A polypeptide comprising an amino acid sequence corresponding to 
Haematococcus pluvialis crtO gene, allelic and species variants, and functional 
naturally occurring and man-induced variants thereof. 

18. A polypeptide as in claim 17, wherein said amino acid sequence is as 
set forth in SEQ ID NO:4. 

19. A polypeptide comprising an amino acid sequence homologous to 
the sequence set forth in SEQ ID NO:4. 

20. A polypeptide comprising an amino acid sequence being encoded by 
a DNA segment, said DNA segment hybridizing under low stringency conditions 
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to nucleotides 166 through 1 152 of SEQ ID NO:l, the polypeptide having a p-C-4- 
oxygenase activity. 

21. A recombinant vector DNA molecule comprising a DN A segment as 
in claim 2. 



22. A host comprising a recombinant vector DNA molecule as in claim 
21, said host is selected from the group consisting of a cell and an organism. 

23. A host comprising a DNA segment as in claim 2, said host is 
selected from the group consisting of a cell and an organism. 

24. A method of producing xanthophylls selected from the group 
consisting of astaxanthin, canthaxanthin, echinenone, cryptoxanthin, 
isocryptoxanthin, hydroxyechinenone, zeaxanthin, adonirubin or adonixanthin and 
combinations thereof, comprising the steps of: 

(a) providing a host as in claim 22; 

(b) providing said host with growing conditions for production of the 
xanthophylls; and 

(c) extracting the xanthophylls from said host. 

25. A method of producing xanthophylls selected from the group 
consisting of astaxanthin, canthaxanthin, echinenone, isocryptoxanthin, 
cryptoxanthin, hydroxyechinenone, zeaxanthin, adonirubin or adonixanthin and 
combinations thereof, comprising the steps of: 

(a) providing a host as in claim 23; 

(b) providing said host with growing conditions for production of the 
xanthophylls; and 

(c) extracting the xanthophylls from said host. 



26. A host as in claim 22, wherein said host is used as a food additive. 

27. A host as in claim 23, wherein said host is used as a food additive. 

28. A transgenic plant expressing a transgene coding for a polypeptide 
including an amino acid sequence corresponding to Haematococcus pluvialis crtO 
gene, allelic and species variants or functional naturally occurring or man-induced 
variants thereof. 
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29. A transgenic plant as in claim 28, wherein said expression is highest 
in chromoplasts-containing tissues. 

30. A recombinant DNA vector comprising a first DNA segment 
encoding a polypeptide for directing a protein into plant chloroplasts or 
chromoplasts and an in frame second DNA segment encoding a polypeptide 
including an amino acid sequence corresponding to Haematococcus pluvialis crtO 
gene, allelic and species variants or functional naturally occurring and man- 
induced variants thereof. 

31. A recombinant DNA vector as in claim 30, wherein said first DNA 
segment is derived from the Pds gene of tomato. 

32. A recombinant DNA vector comprising a first DNA segment 
including a promoter highly expressible in plant chloroplasts or chromoplasts- 
containing tissues and a second DNA segment encoding a polypeptide including an 
amino acid sequence corresponding to Haematococcus pluvialis crtO gene, allelic 
and species variants or functional naturally occurring and man-induced variants 
thereof 

33. A recombinant DNA vector as in claim 30, wherein said first DNA 
segment is derived from the Pds gene of tomato. 
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FIG. 2-A 
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FIG. 2-B 
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FIG. 3 
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Figure 13 
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